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Sir: 

I, Haruki OKAMURA, declare and state as follows: 

1. I am a citizen of Japan residing at Nakahozumi, 2-12-32, 
Ibaraki-shi, Osaka, Japan. 

2. I have graduated from Osaka University, Faculty of Science 
(Biology) and received a doctorate of Science at Osaka 
University in 1976. 

3. I have been working for Hyogo College of Medicine since 
1976. I am the professor, Laboratory of Host Defenses, 
Institute for Advanced Medical Science, Hyogo College of 
Medicine . 

4. I have been engaged in research in the field of 
microorganisms and cytokines, particularly, interleukin-18 
(IL-18). A true copy of my curriculum vitae is attached hereto. 

5. I am one of the inventors on the above-identified patent 
application and am thoroughly familiar with the specification. 

6. I and other co-inventors of the present invention screened 
a protein which induces interferon- y (IFN-y) production by 
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immunocompetent: cells. As a result, I and other co-inventors 
first discovered a novel protein, i.e., interleukin 18 (IL-18), 
from mouse and then revealed its amino acid sequence and 
nucleotide sequence; an amino acid sequence of SEQ ID NO: 3 and 
a nucleotide sequence of SEQ ID NO: 4 in the original 
specification of the above-identified patent application (called 
"the specification", hereinafter), respectively. 

7. At the time the invention was made, once the above 
information were available, then the skilled person in the art 
could have easily understood and engineered various variants of 
the protein with the amino acid sequence of SEQ ID N0:3, i.e., 
a variety of polypeptides which are homologous to the protein, 
where one or more amino acids in SEQ ID NO: 3 are replaced with 
other amino acids, one or more amino acids are added to the 
amino acid sequence of SEQ ID NO: 3, particularly, to the N- or 
C-terminal region in SEQ ID NO: 3; and/or one or more amino acids 
are deleted from the amino acid sequence of SEQ ID NO: 3, 
particularly, from the N- or C-terminal region of SEQ ID NO: 3, 
without altering the inherent biological properties of the 
protein with the amino acid sequence of SEQ ID NO: 3. 

8. Based on these, I and other co-inventors disclosed such 
variants at page 9, 3rd paragraph and page 15, 3rd paragraph 
through page 16, 1st paragraph in the specification. 

9. As regards such variants, they should not be limited to 
specific ones as long as they are the sequence variants of the 
protein with the amino acid sequence of SEQ ID NO: 3 and have 
substantially the same biological properties as the protein, 
especially, an activity of inducing interferon- y production by 
immuno-competent cells and activating the cytotoxicity of killer 
cells, where the activation is augmented by interleukin 2. 

10. The specification is silent about the specific homology of 
the above-identified variants to the protein with the amino acid 
sequence of SEQ ID NO: 3. However, I and other co-inventors had 
in mind that such variants are those which are at least 90% 
homologous to but different from the protein with the amino acid 
sequence of SEQ ID NO: 3, based on the previous publications 
which show the percentages of homology among interleukins (ILs) 
of animals, particularly, rodents such as mice and rats. 
Examples of such publications are listed in "16. References", 
i.e., references (a) to (f). 
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11. In the specification, I and other co-inventors disclosed 
no substantial concrete example of such variants but they were 
surely within our envisaged scope, because I and other co- 
inventors had recognized at the time the invention was made a 
person skilled in the art could have easily understood and 
engineered such variants based on the disclosure of the 
specification, as well as the above-identified publications and 
conventional recombinant DNA technology as shown, for example, 
in "Recombinant DNA" , Second Edition, Chapters 11 and 23, edited 
by James D. Watson et al . , published by W. H. Freeman and 
Company, 1992 . 

12. Besides, for example, US Patent Nos. 4,798,885 and 
5,494,792, which were filed at the USPTO before the filing date 
of the present application and which claim for proteins, 
polypeptides, arid their sequence variants with the desired 
biological activity, for example, at least 90% or 95% homologous 
to the specific amino acid sequence of any one of such proteins 
and polypeptides, were patented without substantial concrete 
disclosure of amino acid sequences of such variants in the 
above-identified specifications . 

13. This means that, even at the time or before 1993 at which 
the above US Patents were made, the person skilled in the art 
to which it pertains could have obtained variants of any 
proteins and polypeptides, if only the amino acid sequence of 
such proteins and polypeptides were once revealed. 

14. Under these conditions, I believe that the specification 
has been sufficiently disclosed in such a manner that it does 
surely enable any person skilled in the art to which it 
pertains, or with which it is most nearly connected, to make the 
invention commensurate in scope with the claims for variants 
which are at least 90% homologous to but different from the 
protein with the amino acid sequence of SEQ ID NO: 3. 

15. I hereby further declare that all statements made herein 
of my own knowledge are true and that all statements made on 
information and belief are believed to be true; and further that 
these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or 
imprisonment, or both, under 18 U.S.C. 1001, and that such 
willful false statements may jeopardize the validity of the 
application or any patent issued thereon. 
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16. References: 

(a) Nucleic Acids Research, D. R. Cohen, A, J. Hapel 
et al., Vol. 14, No. 9, pp. 3641-3658, 1986; 

(b) J. Biochem. , Tsutomu Nishida et al., Vol. 105, pp. 
351-357, 1989; 

(c) Cytokine, Klaus Uberla et al., Vol. 3, No. 1, pp. 72- 
81, 1991; 

(d) The Journal of Biological Chemistry, Wolfgang 
Northemann et al., Vol. 264, No. 27, pp. 16072-16082, 
1989; 

(e) Biochemical and Biophysical Research Communications, 
Fadi G. Lakkis et al . , Vol. 197, No. 2, pp. 612-618, 
1993; 

(f ) Biochemical and Biophysical Research Communications, 
Richard E. Goodman et al.. Vol. 189, No. 1, pp. 1-7, 
1992; 

(g) "Recombinant DNA" , Second Edition, Chapters 11 and 23, 
James D. Watson et al., published by W. H. 
Freeman and Company, 1992; 

(h) US Patent No. 4,798,885; and 

(i) US Patent No. 5,494,792. 
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ABSTRACT 

Genomic clones carrying the rat interleukin-3 (IL-3) gene have been 
isolated and the nucleotide sequence of the gene ' determined. Alignment of 
this sequence with that of the mouse IL-3 gene has allowed the structure of 
the rat IL-3 gene to be deduced. The intron-exon boundaries are conserved and 
extensive nucleotide homology (approx 90%) is present in the 5 1 flanking 
region and the portion of the gene coding for the signal peptide* Several 
proposed regulatory sequences are conserved and an analogous element to the 
tandem repeat in intron 2 of the mouse gene is also present. ,The predicted 
amino acid, sequence., mature .rat IL-3. shows surprisingly, low homology (54%) 
with its murine .counterpart, although all four cysteine residues are con- 
served. The rat IL-3 gene was expressed in monkey C0S-1 cells and colony 
assays established that rat IL-3 is a multi-lineage haemopoietic growth regu- 
lator. There was little cross-reactivity of the respective IL-3 species on 
mouse and rat bone marrow cells suggesting that rat IL-3, in concert with its 
receptor, has evolved significantly away from the mouse IL-3/receptor system. 

INTRODUCTION 

Interleukin-3 (IL-3) is a member of the family of colony 
stimulating factors believed to regulate haemopoiesis^"^. It is 
involved in regulating growth and differentiation of pluripotent 
stem cells leading to the production of all the major blood cell 
types.. The major natural source of IL-3 is the antigen-stimu- 
lated T lymphocyte. although it is also produced by a number of 
continuous cell lines^ - ^. IL-3 has been extensively character- 
ized in the murine system, and cDNA^»^ and genomic®'^ clones have 
been reported recently. Little is known about IL-3 species in 
other mammals. There has been some biological characterization 
of mucosal mast cell growth factor in rats which may be equiva- 
lent to IL-3 10-13 . Pluripotent stem cell factor activity has 
been reported to be produced by the human bladder carcinoma cell 
line 5637*^, but it is still not established whether an exactly 
analogous lymphokine to murine IL-3 exists in man. Southern 
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hybridization analysis of mammalian DNA * s , using a murine IL-3 
cDNA probe, fails to detect homologous sequences in most 
mammalian species, even under conditions of relatively low strin- 
gency (unpublished data). This apparent low conservation of 
mammalian IL-3 genes contrasts with genes for other lymphokines 

such as IL-2, GM-CSF and Y-interf eron which cross-hybridize be- 

15-1 7 

tween mouse and human 

As part of a study of the structure and biological role of 
IL-3 in other mammals, and in view of the possibility of studying 
the role of IL-3 using rat experimental models, we felt it was 
desirable to isolate the rat IL-3 gene and express it in animal 
cells. This is described in the present work together with the 
determination of some of the biological properties of recombinant 
rat IL-3. 

MATERIALS AND METHODS 

General Methods 

(PVGxDA) male laboratory rats were used as the source of DNA 

for all experiments. Genomic DNA was prepared by homogenizat ion 

and treatment with sarkosyl and proteinase K, followed by centri- 

fugation in CsCl/et hidium bromide gradients^' ^. Large scale 

A DNA preparations were made from 1 litre cultures in Luria 

broth, lysed after 6 hr aeration at 37°C. Following addition of 

20 

CHCI3, phage DNA was purified as described . Small scale 
preparations of X DNA from plate lysates on Luria + Mg agarose 
plates were as described , and plasmid DNA mini pr epar a t ions were 

made by the rapid boiling method . Large scale plasmid prepara- 

2 2 

tions were carried out according to Clewell and Helinski and 
plasmid DNA was purified using CsCl /ethidium bromide density 
gradients . 

DNA fragments were isolated by e 1 ec t rophore t ic separation on 
low-melting point agarose gels. Transformations were carried out 
using the high efficiency method of Hanahan. 

The primers used for probe preparation and sequencing were 
synthesized by the phosphoramid i t e method. Autoradiography was 
carried out at -70°C using Fuji X-ray or Kodak XAR-5 film with 
Dupont Lightning Plus intensifying screens. 
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Construction and Screening of the Rat Genomic Library 

Total genomic DNA from rat liver was partially digested with 

Sau 3A and fragments in the size range 9-20kb were purified from 

low-melting point agarose. These fragments were ligated with 

X EMBL3A arms prepared by Eco RI/ Bam HI digestion of the phage DNA, 

and then packaged into bacteriophage particles 21,2 *. Percentage 

25 

recombinants was determined by spi selection 

The library was plated out for screening (using E . coli 
strain ED8655) on L top agarose/L agar in glass baking dishes 
(20cm x 30cm), approx 250,000 pfu per dish. Nitrocellulose 
replicas of plates containing plaques were prepared for hybridi- 
zation as described by Benton and Davis 

Filters were routinely pre-washed in 50 mM Tris-HCl pH8/lM 
NaCl / lmMEDTA/0 . 1% SDS at 42°C for 1-2 hr, pre-hybridized in 
6xSSC/5xDenhardt *s solution/ lOmM EDTA/0. 5% SDS/50ug/ml salmon 
sperm DNA at 65°C for 1-4 hr and hybridized in the same solution 
plus probe at 65°C for 18 hr. Filters were rinsed in 

5xSSC/0. lXSDS and then washed twice in 2xSSC/0 . 1%SDS at 65°C for 
45 rain. 

Probe Preparation 

The 467 bp Hind III - Ncol fragment of the murine IL-3 cDNA- 
containing plasmid pILM3° was isolated and used as a template for 
primed synthesis of radioacti vely labelled cDNA, using a 
synthetic random decamer as primer. Probe preparation was as 
described**, except that synthesized probe was routinely separated 
from un-incor porated label and very small labelled fragments by 
chromatography on Sephadex G-50 (fine). 
Southern Hybridization 

Restriction endonuclease digests of total genomic DNA (15yg 
per lane) or recombinant bacteriophage DNA (0.5-1 ug per lane) 
were elect rophoresed on 1% agarose/TAE (40mMTris-aceta te/ ImM 
EDTA) gels at 25mA for 16 hr. The gels were treated and blotted 
to nitrocellulose as described . Pre-washing, pre-hy brid ization 
and hybridization of filters was as for genomic library 
screening. Following hybridization, filters were rinsed briefly 
in 5xSSC/0 . 1ZSDS at room temperature, and then washed twice in 
4xSSC/0. 1ZSDS at 65°C for 45 min. Stepwise reductions of salt 
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concentration (down to O.lxSSC) were used for additional washes 

when necessary. 

DNA Sequence Analysis 

The sequence of the Hin dlll fragment covering the rat IL-3 

28 

gene was determined by the chain termination method of Sanger , 
using a universal flanking primer^. The DNA was prepared for 
sequencing by sonication of self-ligated fragment, T4 DNA 
polymerase repair and electrophoresis in low melting point 
agarose 30 . Fragments in size ranges 300-500 bp, 500-1000bp and 
1000-2000bp were isolated from the gel and cloned into the Sma l 
site of M13mpl0 31 . Competent cells of E.coli strain TGI, 
prepared by the method of Hanahan , were used for the trans- 
fection of the recombinant M13 DNA. Gels were routinely 5% 
acrylaraide with "wedged" bottoms, and were dried to increase 
resolution^. Compressions were checked using gels containing 
25% f ormamide 33 . DNA sequence data was entered directly from 
au torad iograms into the computer using a digitizer, and assembly 
and analysis of this data was performed using the computer pro- 
grams of S taden 3 * -3 ** . The Genbank database was searched using 

37 

the progam of Wilbur and Lipman 
Expression of Rat In t er 1 euk in-3 

C0S-1 cells were seeded at 5x10"* cells per 60mm diameter 
petri dish, grown overnight in Dulbecco's modified Eagle's medium 
(DMEM) supplemented with 10% foetal calf serum (FCS), then trans- 
fected with the IL-3 expression vectors. Aug of plasraid DNA plus 
16ug of carrier DNA was transfected per 10 cells by calcium 
phosphate precipitation and glycerol shock (15% glycerol for 
three minutes) 3 ®* 3 ^. The cells were washed in DMEM containing 
10% FCS, and incubated in 5ml of the same medium at 37 C in 5% 
CO2. Supernatants were collected at 72 and 144 hours post- 
transfection and filtered through Amicon 0.2pm filters. Samples 
were stored at 4°C prior to assay. 
Assays for IL-3 

1. Colony forming assays. Colony assays were done using bone 
marrow cells from WISTAR rats or BALB/c mice, scoring colonies on 
day 7 of culture 3 ^. The entire culture was fixed with 2.5% 
glutaraldehy de then stained with Lutol Fast B lue-hema t ox y 1 in to 
identify granulocyte-macrophage, erythroid, mixed erythroid and 
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megakar y ocy tic colonies. Each culture contained 75,000 bone 
marrow cells in 1.5 ml. 

2. Bone marrow cell proliferation assays. Serial two-fold di- 
lutions of test material were made in microtitre plates in 50ul 
RPMI 1640, 10% FCS. 5xl0 A WISTAR rat or BALB/c mouse bone marrow 
cells were added to each well. Bone marrow cells from PVG, DA, 
PVG x DA, JC, PVG x JC and Fischer rats were also tested in this 
assay. Cultures were pulsed overnight on day 3 with [ H]— 
thymidine. Activity is expressed as a reciprocal of the titra- 
tion endpoint, the endpoint being defined as the supernatant 
dilution at which activity is no longer detectable^. 

RESULTS 

Southern Hybridization of Rat Genomic DNA 

Rat genomic DNA was digested with the restriction enzymes 
Eco RI and Hin di 1 1 , electrophoresed on an agarose gel, blotted to 
nitrocellulose and hybridized with a murine 1L-3 cDNA probe. A 
single hybridizing fragment of appropriate intensity for a 
single-copy gene was observed in the Hin d I II digest at approx. 
5.8kb (Fig.l). The Eco RI digest gave two bands, suggesting the 
presence of an Eco RI site within the rat IL-3 gene. The reason 
for the different intensity of the two Eco RI bands is not clear 
but may be due to a greater representation of sequences derived 
from particular regions of the template fragment in the random 
primer probe. 

Screening of the Rat Genomic Library 

A library of approx. 10° recombinant phage was generated by 
cloning 9-20 kb fragments from a partial Sau 3A digest of total 
genomic Rat DMA into AEMBL3A arms created by Bam HI/ Eco RI diges- 
tion. The complete library was plated out onto 4 baking dishes 
(approx. 250,000 recombinant phage per plate) and duplicate lifts 
were taken from each dish. A total of 5 plaques were identified 
by hybridization to the 467 bp Hin dlll- Nco l fragment of murine 
IL-3 cDNA labelled with ^2p by randomly primed synthesis. These 
5 clones were purified and small scale XDNA preparations were 
carried out. Digestion of the DNA with Eco RI showed the five 
clones to be overlapping, each containing one or both of the two 
expected Eco RI fragments (Fig. 2). Clone AR3, which contained 
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genomic DNA AR3 DNA 

H E H E H/E 




Figure 1 Southern blot analysis of rat genomic DNA and DNA from 
the genomic clone AR3, probed with an [a- P], dATP - labelled 
murine IL-3 cDNA fragment derived from pILM3°. Gel electro- 
phoresis, blotting* probe preparation and hybridization 
conditions are given in Materials and Methods. Restriction endo- 
nuclease abbreviations are E, Eco RI and H, Hin d 1 1 1 . Sizes given 
are in kilobase pairs (kb). 



both fragments, was grown in large scale for further characteri- 
zation. Southern hybridization of enzyme digests of XR3 DNA 
shows that it contains the fragments originally seen in the 
genomic DNA digests (Fig. 1) and allows localization of the rat 
IL-3 gene to a pair of fragments (3„3kb and 1.25kb) from a 
Bam HI/ Hin dlll double digest (Fig. 2). 
Nucleotide Sequence of the Rat IL-3 Gene 

The nucleotide sequence of the 5.8 Kb Hin d 1 1 1 fragment 
encompassing the rat IL-3 gene was determined on both strands by 
the chain termination method. The sequence of this region 
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Figure 2 Restriction map of the rat IL-3 gene showing the 
regions covered by the 5 overlapping clones isolated from a 
XEMBL3A library of rat genomic DNA. The map of the gene was 
deduced from the characterization of purified XR3 DNA. Proposed 
exons are indicated by solid boxes* and cover the region from the 
AUG codon believed to be the start site of translation through to 
the translational termination codon UAA. Restriction endo- 
nuclease abbreviations are E t Eco RI and H p Hin di II . 

(excluding the 5' 1.25kb Hin d I I I / Bam Hl fragment) is presented, 
together with an alignment against the murine IL-3 gene sequence, 
in Fig. 3. The overall homology between the two nucleotide 
sequences in the coding regions is 76% ; the introns and flanking 
regions (approx. 200 nucleotides either side of the gene) are 
more highly conserved t with 80% and 90% homology respectively. 
Structure of the Rat IL-3 Gene 

Sufficient conservation of gene structure was present to 
allow the structure of the rat IL-3 gene to be deduced by align- 
ment with the gene for murine IL-3 (Fig. 3). The rat IL-3 gene 
consists of five exons interrupted by four introns with conserva- 
tion of the intron/exon boundaries between the rat and mouse 
genes. The deduction of the gene structure is greatly assisted 
by the high sequence homology through these regions. All introns 
interrupt the coding sequence between codons. 

Sequence homology in the region corresponding to the leader 
sequence and extending 400 nucleotides upstream of the transla- 
tional start site in murine IL-3 is extremely high. It is there- 
fore likely that translation of rat IL-3 commences at nucleotide 
position 1374, making the first ten amino acids identical to 
those in the leader portion of murine IL-3. Transcriptional 
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(R) 10 2" 30 40 50 60 70 80 

(^TCCAGCAGACCAGTCll X TI'XGIACC ^ 

100 110 120 130 140 ISO 160 170 

CCCCC^CCCCClXXJ'lVnxX^CCCCT MVm^ 

190 200 210 220 230 240 2S0 260 

CTTCCTAcecrrrcM'n i i'i u;!!^^ 

280 290 300 310 320 330 • 340 350 

CCCCACGCTTCrcACACrC A l inV JJlCC^ 

370 380 390 400 410 420 430 440 

CCTTCAGCCaMrarcCCCTCGCrc 

460 470 480 490 500 510 520 530 

TAA LlUltALlWTllL ' niJlU ^CTCT-GrAO^^ 

550 560 570 580 590 600 610 620 

OATCCCTCAATAJOG CllUtXTlliA OOGAGT^ 

640 650 660 670 680 690 700 710 

CTXX^X:AGCrCCCACrrAACTAACAlT ^ ^ 

730 740 750 760 770 780 790 800 

GGCCTA l/lvri - lt^ CTCATATCTCCATCAA U^T l V 

820 830 840 850 860 870 880 890 

AGTCAAGGACTIGAGCTOUtCTC^ 

910 920 930 940 950 960 970 980 

AGCOGCCAGACraurrAGCCCACCCJJ^ rilA-LllA CTCAClLlU^T CC CC 

1000 1010 1020 1030 1040 1050 1060 1070 

CACTAAIAGTOg roG CE CA CAtn^^ 

i 1 1 1 ■ 1 1 1 1 it nut ttiiiffititisti tti ttiit i in inn iiiiiiiiiiin 
(M) CMZrCMSGGICM UmVLW 

1090 1100 1110 1120 1130 1140 1150 1160 

(HtfXXACCCTACTTCCICa^CCA ^^ 

::::ti:n nrt:mni:i iiiiiiiut im iiiiiiiiiittiiiiiititiiiititiiniii tin tinii t mi i i 
GAGCXAOCXrrA L - ri\X ' lXJC CAaU M-L lUlTll CACTCCgrCCAlCTCTJraaCAAA^^ 1 

1180 1190 1200 1210 1220 1230 1240 1250 

TCCCArciCGATAATTTTCy^ 

ttiiiti :tt:t::i :::i::iiiinii itiiititiit iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiim iii i nit 
TCCCATCTCX^TAA lVIUtiA GTACTACAAACTCAICAA^ 

1270 12B0 1290 1300 1310 1320 1330 1340 

(axrroCC X T llCLX KCATtgC lTl^L ^ ^^ 

tillli:>: tun tit it nilttll ttltitli nun mi i in I in tt 1 1 i in 1 1 1 i it i i 1 1 i 1 1 

«xiTCCCTcc(xxx--raxccrex3X^ 

1410 1420 1430 

ri cror A TC x ntJcnvxixxiv^uAiC L it.ri u- 
tt t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ■ i ■ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i iniiifiit 1 1 1 1 1 1 1 1 1 1 1 1 tit 1 1 

ACCCCTTOyUXaWXACaiACGACACX*^ 



L-exon 



1450 1460 1470 1480 1490 1500 1510 1520 

ACCACOGACTCCAflATrrCACACAI A^ 

lit in i i it i tit iii tit nun titt in n un 1 1 1 1 1 1 i mm t iiiiin i i 

ACCTCXaSACrCCAAlXTITCAATC 



imiiiii itimtimi ttttti nun ttit ttitiitttittt nt i tn iii iiii iuiiii 



1540 T 1550 1560 1570 1580 1590 1600 1610 

TGAAOrrCCCAGTGJCGTAGCTGGCT^ 

i 1 1 it : : : :i i it) I : 
CCAAGCTCCCACTCACT, 

•^H- ex on H 

1620 1630 r 1^40 1650 1660 1670 **| 1 690 1 700 

TTTTAGTATCCT- - CAJGGTATCITXiACTCAATAATAGTCAC^ 

iti : : : t : tin i ill ttttti i t tti tt t it tiiiiittilltitiitii II t i i tit til l t I I 
TTTCTAltTItZCrCACJtfSGiJiCCTTjAACT 

*-exon 2-^ -♦ox on 2 J 

1710 1720 1730 1740 1750 1760 1770 178 0 1790 

TCCATCTCTCIX:CTCXXTCXXrrGACTTrA^ 

tiiitii tiiiiiiiittiiiiim t m I I till til iii t I I I I 1 t I ill! i i i i i i i i i t t i i i i i i i i i i t t ■ i 
TCCATCTATCTCCTCXrCTGCCTGACTTCAGCCAT - -OlCTCCACCATCCL'l , I\A-l"l , LCAl 1 1 1UI*CATCTATCTCACTCGCGTTATTAAC 

1800 1810 1820 1830 1840 1850 1860 1870 1880 

GAAATCATCAGATGAC1L lVltiACXrCTCA^ 

tin it i i : t i i tin : i it timtiii i i ti in iiiiiiiiiiiniiiiiii i iiiiitu tttiisii lit it 
GAAATCATCAGATGACTCTCA - AGCCTCAGTATATACCTCAGTC AGCTCX^ATAAI^ 

1890 1900 1910 1920 1930 1940 1950 1960 1970 

AACTCTGAGTGAACCCTGCCA CAKXX^CTOGCTCCACTTTCACTOGOC»TTKX^ 

itttt iiitittitiM I I I I I 1 1 it I I I I i i i m t ii i i I I 1 1 1 I i I I I I tit 
- - CTCTGCGTGAACCCTGCCTGCTGC AOCCCTCTCCCTTC TCC 
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I960 1990 2000 2010 2020 2030 2040 2050 

i tttitttti ti t i i ti iti n I I I I t ii I i i n i I ni ti i ti t t i i i i i ii : : 1 1 t t i i : j i i ttittttti in 

ccnrrgrcA Liini,iwiwTiTu^ 

2060 2070 2080 2090 2100 2110 2120 

IXXlACTT MTriV CCAGCTCTACITXGATAGAATC CAAGCGTGC 

i: Mtittititi itttt titttitt til iii t i 1 1 i i i t ■ i i 1 1 1 1 ■ iiti till 

TGTACTTATTTTCC TAC.I'l'lVjAIACAATAGTTXi'U 11 ' 1TU1TXT1 A1T11 lUll^ ACCTAACATCAA U ' llVl ' lUtX AAGACTCC 

2130 2140 2150 2160 2170 21B0 2190 2200 2210 

TCACTCTCTACCTTrCCCCTCCCTACOGACATCATOAAGTCTCT 

it: itttti 1 1 1 1 1 i : i : 1 1 1 1 i t itttt ti tt it ttiititititiitii nit tttt ititi:tttittttitiitt 
TGAAICTCTATA T ITUA X TUX^^^ ^ 

2220 2230 2240 2250 2260 2270 2280 2290 2300 

TTGGGCTTGGGGATTTAGCTCAGTGGTA^ 

Mil 

TTGG 

2310 2320 2330 2340 2350 2360 2370 2380 2390 

AGAAAAAGAAAAAAAATTAAAGCATTA MXU ' lt^ 

tit 1 1 1 ::in:tti:::: itttt iiiiiiiitii:::tiitiiititini 

ccrcGGTTTrrcccAxcrroc T MrmcTix^ xxAAC LTivit^xi ' iriviA CA 

2400 2410 2420 2430 2440 2450 2460 2470 2*80 

ttrmTTCTGC^aUOTATCTTTCCCTTJU^^ 

t i : : mini ttt ttitt ttt tt ttttitt miiiiiii :i itititttttt tinitniitttt tnititint 
GCTTTrCTtMAGCGATCTXnTTTCCTCA^ 

2490 2500 2510 2520 2530 2540 2550 2560 2570 

tint inntttttinitttttti ttt t t ittttttt : : t l t t i i t : 1 1 1 i ttti ttttt ttttttiittiititi 
Ult^Tl t AX ^GAAACAOCXrC LTlV^ 

2580 2590 2600 2610 2620 2630 2640 2650 2660 

AACCTCTAACACCACTCACACCTCCCCACCTCTCACCT^^ 

tl ttt It till! ttttt lilt tit ttt t lit illtllllll I I I t 1 t t 

AATCTCAAATAGCAGGCACAL 1T LL L lAGCTClGMXnXTrcCAGCTCTCAlllllCAA C 

2670 2680 2690 2700 2710 2720 2730 2740 

rcrcACCTCTCCMxrrcTCACTrra 

titiinii tiittttt i t 1 1 1 t I I 1 1 1 1 i t t i i t 1 1 I 1 1 I ■ t 1 1 i i I I ttitt ttttt tt::ill 
TCTCACCTCCCCAG llUlTllCmi CA ^ ^ ^ ^ 

2890 29O0 2910 2920 2930 2940 2950 

(approx. 140 nucL) tccacckxttcatctctacaactctcacctccc^^ 



2960 2970 2980 2990 3000 3010 3020 3030 3040 

TACCTCCCTA GTlVltJ ^rrCCTCTGA^ 



3050 3060 3070 3080 3090 3100 3110 3120 3130 

CACCTCCCAAGCTCTCACCTCCCC^^ 

: : : i : 1 1 t : i i i t i tittttttttttt 
CTCCCAACCTCTTTCATTTCCACTTCTCACCT 

3140 3150 3160 3170 3180 3190 3200 r 3210 3220 

TTTCCA Lri'lt-'lT ACCACCXraU^^ 

iittitiitiittiiitt i i 1 1 i t tttttii 1 1 t ■ i 1 1 1 t 1 1 1 1 1 t i i i ix i t i i i 1 1 t t 1 1 
CCCAGCTCTCACATCCCTGGGCCTTAC^ 

Lexon 3-4*- 

3230 3240 3250 3260 3270 3280 3290 T 3310 

C CTGGACGAATTCCTAAAAACCCAAGACGACTTTGATTCTCACGACACA^ 

iis: : ttttt i miiiiiii ii t tt: tt tititi t i tttttittiii t i i i t i : i t t t ; i i nn it 
CCTCTCCAAATTCGTGGAAAGCCAAGGAGAAG^^ 

-*-exon 3 J 

3320 3330 3340 3350 3360 3370 3380 3390 

GGGCTTAGGGATCCCAC CTCC AC L mi CCACC l^lTriXX atf^OCaUlACT^ 

ttt t nt tttt in intiini iniiitm tniinii it ttttt: i it tiitttt 

CXXnTGGGGGTCCGGGCTGGGGCTCC Ol^ 



3400 3410 r 3430 3440 3450 — I 3470 3480 

CTCACTOTCCLUlWlXflVlintXJiau^ 

i 1 1 it i i in in t :n : i 1 1 n i i 1 1 : : : t t i t t iti ii intit inittt iimiimi ti in i i 

CTCACTAICC LTl\Xflt; TClX^C C ACACAAACTTAAL ll/lUV^L'llCCTAiZATCTCCGAAIT^^ CCCITCCC 

Lexon 4 -*-exon 4 J 



3490 3500 3510 3520 3530 3540 3550 3560 

T CTGTCAAGGTTA TCXCCTTGCTCATCJACCAGACTTAAAGCCTGCTATA^ 

tin tt ttiititit : t i it i i : t i i i i i it nit i im tint t t uii::n iinit: ttt 

CCA lUWlTl 'ArcACXXrrCTOCLVITl ^ ^ 
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I" 3590 3600 3610 3620 3630 3640 3650 



3570 
TTTTGGTTCGTt 

it tttiu inttt tititn i tn tt i 1 1 1 1 1 1 1 1 1 1 1 ■ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r t 1 1 ti ittt 1 1 1 1 1 tt 

Lexon 5-^- 



3660 3670 3680 3690 3700 3 710 3720 3730 3740 

CCTGCAGCCAGTGTCAGTCTCTAGACCACCTCAGrc 

III II Mill i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 • I lllll till ti t it i 1 1 1 1 I 1 I I I I I I I t I 1 I I I I I I I 

TCTCxyuiJwiMrrcxrrAAccTCTA^ 

3750 3760 3770 37B0 3790 3800 3810 3820 3830 

(X^CA(KAACT(»A tXtATJJ^VlT^ I ' lA CCAATTTAT 
tnitlii tt tt itttttttitttttitttitt t it I 1 1 1 1 1 i ttittii niitiiitt: ttiitiit 

3840 3850 3860 3870 3880 3890 3900 

CTACTTTCTGAAATCTGCTAA TTACCTAfclTATCCCCCTAlTTTAITCTArTAACCCrArrrAI CTATTTATCT 

iii ittt 1 1 1 1 1 1 1 i t t 1 1 1 1 1 1 1 1 1 1 1 • t titiiiiiiitt 1 1 1 1 1 1 1 1 1 1 t r 1 1 1 ittitititi 

CTA LllllUjJ UUaTTACAA lUrilCm 

3920 3930 3940 3950 3960 3970 3980 3990 

ATTTATTTATTTATTACCTTCTCXIAATGTGAACTA^ - TTCOCAGA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ittiiti linn iittii 1 1 1 1 ■ 1 1 1 1 1 1 1 1 1 1 1 1 1 iiiiiii miiii ttii it 

ATTTATTTATTTATTCCLUT.VlVltiATCTCA-GTATAl'L 11, 1 1 1 1 A^XTK^CCACGAl/lTlVItJC AAACAAAATTCCAAGC 

4000 4010 4020 4030 4040 4050 4060 4070 4080 

(aGACTPCQOOOCCTCTrC A™ 

Iltllttlt lllittlltllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III ttttl 

AAGACTCO0CCCA-TCTTCATTTGTCCCT TCTrCGAAATAAATXACTTTCAACAAATOGCTTTG 

4090 4100 4110 4120 4130 4140 4150 4160 4170 

GTCTTCTCCACTTTCTr A OTmAjC ATCAlCTCIOI A J^ ^ 

•J > ititi mi iii iiiiiii t t i i 1 1 i 1 1 1 1 1 1 1 1 1 ■ i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t: t :t:i ititiittitit 

41B0 4190 4200 4210 4220 4230 4240 4250 4260 

TACAMaTGraCCAGMC MWl C lt*^ 
t tttiittii 
TGCAACAITTTAT ■ 

4270 4280 4290 4300 4310 4320 4330 4340 4350 

ATtXXaCCMTCAACTCCAGTCr UG CAAOGI^ 

4360 4370 4380 4390 4400 4410 4420 4430 4440 

GC CCCTGGAACATGCTACTCACTACTGAAAGTCATrAGAATCXTTC CTCTAACACCT 

4450 4460 4470 4480 4490 4500 4510 4520 4530 

CCCATACTAGAAGCTGTTCACTACCATTCT 

4540 4550 4560 

xvritJU ^CAA i>iTivu a^cai x t:r A AOCTT 

Figure 3 Nucleotide sequence of the rat (R) IL-3 gene covering 
the 3.3 Kb Baro HI and 1.25 Kb Bam H I /Rind I I I fragments, shown 
aligned against the sequence of the murine (M) IL-3 gene. TATA 
box and pol y-adeny la t ion signals are underlined. Intron/exon 
boundaries are shown as deduced for the murine gene • . 

control ' signals (including the potential enhancer hexanuc leo tide 
CCGCCC) described by Campbell et al 8 and Miyatake et al 9 are 
identical, although the proposed CAAT sequence would be a further 
7 nucleotides upstream of the TATA box in the rat gene than is 
the case in murine IL-3. 

Nucleotide sequence homology of the 3* end of the coding 
region is once again high, and there is an in-frame t ranslational 
termination codon at position 3733. A single AATAAA sequence is 
found in the 3 f untranslated region, 25 nucleotides nearer the 
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TAA stop codon than in murine IL-3. This sequence is likely to 

a i a o 

be involved in polyadeny la t ion of the mRNA ai, ^ z . 

Other conserved features include two sets of tanderaly 
repeated sequences. The first, a 14-15 bp element repeated about 
12 times in intron 2 of murine IL-3,. is also found in intron 2 of 
rat IL-3 t except that in the latter case the repeat appears to 
extend for a further 400-450 nucleotides ie 25-30 more repeating 
units than in the murine gene. The complete sequence through 
these repeating units was not determined due to the difficulty in 
obtaining clones through this region. The amount of "missing" 
sequence was deduced by visual estimation from sequencing gel 
au toradiograms together with careful estimation of restriction 
fragment lengths. The consensus sequence of these repeats. in the 
rat gene, AGCTCTCACCTCCC , is identical to the consensus sequence 

Q 

of the murine IL-3 intron 2 repeats . The 9-bp inverted repeat 
which flanks these tandem repeats in the mouse IL-3 gene is not 
conserved in the rat gene. The second repetitive element occurs 
between the termination codon and the polyadeny lation signal 
(nucleotides 3872-3922), and consists of nine copies of the 
sequence TATT 8 . 

One further feature of interest is the presence of sequences 

in the 5' untranslated region which show homology to murine Bl, 

B2 and new Alu type repeating elements (data not shown). These 

elements represent major classes of short repeats dispersed 

throughout the mouse genome an( j ma y eacn be present in up 

to 50,000 copies per haploid genome. Sequences homologous to 

rat poly (RY) repeats^ are also found in this 5' region. The 

function of these sequences is unknown; there are suggestions 

ranging from a possible role in regulation of transcr iption/mRNA 

transport/translation^"^ to the notion of "junk DNA" or pseudo- 
49 

genes 

Amino Acid Sequence of Rat IL-3 

The deduced amino acid sequence of rat IL-3 is shown in 
Fig. 4 compared to murine IL-3. Overall homology is 59% which 
reduces to 54% in the "mature" protein ie. after the removal of 
the putative leader sequence^'^ , which is the most highly 
conserved part of the protein. Rat IL-3 has two potential 
N-glycosylat ion sites : the second of these corresponds to one of 
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homology 

(R) MVLAS3TT3 IUMXPLLKLFHOGLQ I SOW3IW«XR3TJXOT LXLEI LVK1P 

AXOn 1 A********* AAA AAA*** *AA A A A A A AAA A A AA AAA 67% 

(M) HVLA33TT3 IHTMLLLLLKLFHLGLQA3 1 SCRDTHRLTRTTJIC3S IVXEI IGKLP 



(R) V3GLKH3IMCAHLft 

•xon 2 a" a aa 29% 

(M) EPELKTDCEGPSLR 



(R) MSTLRJn/HLEEXTJCSQEQIBQfiTTDIESltLQ, 
axon 3 * aaaaa a aa a a a aaa aa 53% 

(M) tpCSFRStVgLSKFVESQGCVDPEDRTVXKSNLO 



(R) KLXCCIPAAAS08V 

• XOH 4 AA AA A A A* 57% 

(M) KLNCCLPTSAKDSA 



00 i*camtxrLuariaaaxFYvimnx^^ 

SXOn 6 AAA A AAAAA AAAAAA A A A A * A AAAAA A A AAAA 63% 

(M) LFCVFIRIXJDCFRKXIJlFY>fVML*D 



overs B homology 59% 

Figure 4 Comparison of predicted coding sequences of the exons 
of rat (R) and murine (M) IL-3 genes. Potential N-glycosy lation 
sites are underlined* The arrow indicates the most likely site of 
cleavage of the signal peptide 

the 4 potential sites in murine IL-3. All four cysteine residues 
are conserved. 

Expression of Rat IL-3 in Monkey COS-1 Cells 

The 5.8kb Hin d II I fragment carrying the entire rat IL-3 gene 
(including transcriptional/translational control signals) was 
cloned into the Hin dll I site of the expression vector pSV2— neo » 
Two clones were isolated, one for each possible orientation of 
the IL-3 gene relative to the SV40 promoter: pILRl contained the 
IL-3 insert in the same orientation as the SV40 promoter and 
pILR2 had the insert in the reverse direction. Purified plasmid 
DNA from both constructions was used to transfect monkey COS-1 
cells. Supernatants from the transfected cells were collected at 
72 and 144 hours, based on previous experience with this 
expression system for murine IL-3. Murine IL-3 was prepared at 
the same time, using pS V2- neo vectors containing either genomic 
(pILM13) or cDNA (pILM4) murine IL-3 inserts. Supernatants from 
untransf ected COS-1 cells were used as controls. 
Colony Stimulating Activity of Rat IL-3 

Recombinant rat IL-3 from both pILRl and pILR2 stimulated 
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Table 1 : Bone Marrow 


Proliferation 


Activity of Rat 


and 


Murine IL— 


Growth. Factor 


Source 


Bone Marrow Proliferation^ 3 ^ 
(Units/ml ) 
Mouse Rat 


Mouse 11-3 


WEHI-3 CM 


1025 




<10 


Recombinant 
mouse IL-3 


COS-1 


. 250 
360 
90 




<10 
<10 
<10 


Recombinant 
rat IL-3 


COS-1 


<10 
10 
50 




725 
510 
630 


Mouse GM-CSF 


purif ied* b ) 


3100 




210 



(a) Proliferation was measured as described in Methods using 
bone marrow cells from either Balb/c mice or WISTAR rats. 
Activity is expressed as the reciprocal of the titration end- 
point. COS-1 conditioned medium showed no detectable activity. 

(b) Purified murine GM-CSF was obtained from Genzyme (Boston, 
Mass.) Lot No. 01531 and was verified to be GM-CSF by its ability 
to stimulate growth of FDC-P1 but not 32Dcl-23 cells. 

colony formation only with rat bone marrow cells, while expressed 
murine IL-3 stimulated colony formation only with mouse cells. 
Bacterially expressed murine IL-3 was also inactive on the rat 
bone marrow cells. 

In these experiments, rat IL-3 produced the full range of 
colony types expected for this factor on rat bone marrow cells, 
although there was a much greater proportion of megakaryocytes 
and eosinophils amongst the colonies generated than in the 
corresponding assays of murine IL-3 on murine cells. (A similar 
elevated response of megakaryocytes and eosinophils has been 
observed when assaying bacterially expressed murine IL-3). The 
biological properties of recombinant rat IL-3 will be reported in 
detail elsewhere (A.J.Hapel, D.R.Cohen and I.G.Young, In prepara- 
tion). 

Proliferation of Rat and Mouse Bone Marrow Cells in IL-3 

Bone marrow cells are generally as sensitive as FDC-P1 cells 
for detecting IL— 3 and GM— CSF activities in the mouse when used 
as at 10 J cells per well in the standard raicrotitre cell proli- 
feration assay. As there are no established cell line assays for 
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rat IL-3 we used bone marrow cells from a variety of rats and 
BALB/c mice to measure the growth factor activity of expressed 
rat and mouse IL-3, 

The data in Table 1 show that rat IL-3 had good activity in 
promoting the proliferation of WISTAR rat bone marrow cells and 
slight activity on murine cells, while expressed murine IL-3 was 
active only on cells from murine bone marrow. This is in con- 
trast to murine GM-CSF which has significant activity in the rat 
system. A similar result has been obtained using bone marrow 
cells from other strains of rats, except that PVG, PVG x JC and 
PVG x DA rats gave end-points about 4-8 fold greater than WISTAR, 
Fischer and JC rats. More extensive characterization of the 
biological relationship between rat and murine IL-3 will be 
described elsewhere (A.J.Hapel, D.R.Cohen and I.G.Young, In pre- 
paration ) . 



DISCUSSION 

Previous work on the biology and molecular biology of IL-3 
has been predominantly with the mouse system. It is clear from 
the work described above that the rat genome carries an IL-3 gene 
which is closely related in overall structure to that found in 
the mouse genome. The cloning and sequence analysis of the rat 
IL-3 gene has enabled a detailed comparison to be made between 
the rat and mouse genes. The flanking regions (200 nucleotides 
either side of the gene) and introns of the mouse and rat IL-3 
genes show 90% and 80% nucleotide homology, respectively, with 
the intron/exon junctions fully conserved. The portion of exon 

1 encoding the putative signal peptide is also highly conserved 
( 90%). Surprisingly, the nucleotide homology between the coding 
regions is only 76%. This^ is reflected in the amino acid 
homology which is on ly , 54% for mature ra.t aod mouse IL-3 (59% 
including the signal peptide). This is atypical of other lyrapho- 
kines characterized to date. For example, y -IFN shows an amino 
acid homology of 87% between mouse and rat , while IL-2 and GM- 
CSF show mouse : human amino acid homologies of 60% and 54% 
respectively * ^* The very high conservation of the signal 

peptide region observed between rat and mouse IL-3 is also not 
typical of the other lymphokines 15 * 16 ' 53 • 5 * . 
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The conservation of the two sets of tandomly repeated units 
is also of interest. The TATT sequence, repeated nine times 
between the translational termination codon and the poly-adenyla- 
tion signal in both murine and rat IL-3, is also found in a 
similar position in all IL-2, GM-CSF and Y-IFN genes sequenced to 
date 15 " 17,53 " 58 . The consensus sequence of the other repeat, a 
14-15 bp element in intron 2, is identical to that of the tandem 
repeats in intron 2 of mouse IL-3. This element shows homology 
to a human BKV enhancer sequence homolog ' , and also to a 
portion of the consensus sequence for a 33-bp human myoglobin 
gene tandem repeat 0 ' 0 . Similar repeats are found in the human 
insulin gene and globin gene complex 0 ^ and are associated with 
"hy pervar iable regions" of DNA, 

Expression experiments using monkey COS-1 cells have 
demonstrated that the rat IL-3 gene encodes a multi-lineage 
haemopoietic growth regulator, which appears to have an analogous 
biological role to mouse IL-3 63 . The low amino acid homology 
between rat and mouse IL-3 correlates with the demonstration that 
rat and mouse IL-3 show little cross-reactivity. Other lympho- 
kines and CSF's are somewhat varied in this regard. For example, 
human IL-2 is about 5 times more active on mouse cells than is 
murine IL-2 on human cells 15 , whereas human and murine GM-CSF's 
do not cross-react either way 16 . Rat Y-IFN is active on mouse 
cells but not on human cells 53 , and human M-CSF can stimulate 
growth of murine macrophages 6 ^. The low sequence homology and 
poor cross-reactivity suggest that rat IL-3, together with its 
receptor, have evolved significantly away from the murine 
IL-3/recep tor system. 

The availability of the rat IL— 3 gene together with recom- 
binant rat IL-3 should assist in studying the biological role of 
IL-3 in rat experimental models such as adjuvant-induced 

arthritis 65 , graft rejection 66 and megakaryocyte prolif- 
67 68 

eration » . The rat gene sequence also provides useful 

information for continuing studies on the regulatory elements 
controlling IL-3 expression. 
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1 A cDNA sequence coding for rat interleukin-lo (IL-la) has been isolated from a cDNA 
library that was prepared with mRNA derived from LPS -stimulated rat peritoneal 
macrophages by using human IL-la cDNA as a probe. The rat cDNA encodes a 270 amino 
acid residue protein which is homologous (65%) to human IL-la. The rat cDNA sequence 
under SV40 early promoter directed the synthesis of biologically active IL-1 in monkey 
COS-1 cells. Rat IL-la mRNA is not expressed in spleen, lung, liver or brain, and is also not 
expressed in these organs of LPS- treated rat except spleen. This suggests that IL-la is not 
produced constitutively in various tissues and LPS is not sufficient to induce IL-la in most 
tissues. Our data indicate that the IL-1 activities which have been reported to be produced 
in the brain are not of a type. We have constructed a plasmid expressing the carboxy 
terminal 156 amino acids in Escherichia colu Recombinant rat IL-la produced in COS cells 
or E. coli has cytotoxic activity against the human melanoma cell line A375S1 (GIF 
activity), which has been reported to be sensitive to human IL-la and IL-1£. This suggests 
that GIF activity is common to IL-ls derived from various sources. 



Interleukin- 1 (IL-1) is a cytokine released from activated 
macrophages and numerous other cell types. It is involved 
in immunologic and inflammatory responses, including the 
stimulation of lymphocytes proliferation, lymphokine 
production, prostaglandin production, and the synthesis of 
acute phase proteins (1). It has become evident that IL-1 
activities are encoded by two distinct cDNA sequences in 
human (2-4) and these cDNAs have been termed a and /?, 
respectively. These cDNAs have been expressed in E. coli, 
and large amounts of both recombinant IL-ls have been 
purified by several groups (5-22). We have also described 
the isolation of both cDNAs from a cDNA library that was 
prepared with mRNA from human histiocytic lymphoma 
cell line U937 (12) and the purification of recombinant 
IL-l<r and p expressed in E, coli (13, 14). The availability 
of recombinant IL-1 has permitted confirmation of the 
multiple biological activities of IL-1 (14, 15). Recently 
recombinant human IL-1 has been reported to stimulate 
the proliferation of astrocytes (26), the secretion of 
adrenocorticotropic hormone (ACTH) and Cortisol in mice 
(27), the secretion of thyroid -stimulating hormone in rat 
pituitary cells (28), and the secretion of corticotropin - 
releasing factor in rats (19, 20). Thus, these reports 
indicate that IL-1 is also active in the regulation of the 
endocrine and central nervous systems. An animal model is 
required to elucidate the roles of IL-1 in vivo. The adjuvant 
arthritis rat (22), the . glbmerulonephritic rat (22), the 
adrenalectomized rat (23), and the hypophysectomized rat 
(24), among others, are very useful in this regard. How- 
ever, the purification of rat IL-1, the preparation of 
antibody against rat IL-1, and the isolation of rat IL-1 
cDNA have not been reported. So we first tried to isolate rat 

Abbreviations: IL-1, interleukin- 1; LPS, lipopoly saccharide; PHA, 
phytohemagglutinin; FCS, fetal calf serum; PEC, peritoneal exudate 
cells; LAF, lymphocyte activating factor; GIF, growth inhibitory 
factor; ACTH, adrenocorticotropic hormone. 



IL-1 cDNA. In this paper, we describe the molecular 
cloning of rat IL-la cDNA, the expression of rat IL-1^ 
mRNA in some tissues, and the production of the mature 
form of rat IL-la in E. coli. 

MATERIALS AND METHODS 

Enzymes and Chemicals — Restriction endonucleases and 
T4 DNA ligase were purchased from Takara Shuzo Co., 
Ltd. Avian myeloblastosis virus reverse transcriptase was 
obtained from Bio-Rad Laboratories; calf thymus terminal 
deoxynucleotidyl transferase, E. coli DNA ligase and 
RNase H from Pharmacia; and E. coli DNA polymerase I 
from Boehringer Mannheim GmbH. All these enzymes 
were used according to the supplier's recommendations. 
[a- 32 P]dCTP and [ 3 H] thymidine were purchased from 
Amersham. E. coli lipopoly saccharide (LPS) and proteose 
peptone were purchased from Difco Laboratories, phyto- 
hemagglutinin (PHA) from Burroughs -Wellcome, in- 
domethacin from Kyowa Hakko Co., Ltd., oligo(dT) cellu- 
lose from Collaborative Research Inc., RPMI-1640 medium 
and fetal calf serum (FCS) from Gibco Laboratories, and 
Hanks 1 balanced salts from Flow Laboratories Inc. All of 
the other reagents used were analytical grade. 

Isolation of RNA— Rat peritoneal exudate, cells (PEC) 
were obtained according to the procedure described by 
Yamamoto and Tokunaga (25). Sprague-Dawley (SD) rats, 
purchased from Charles River Laboratories, were infused 
i.p. with 30 ml of 10% (w/v) proteose peptone. After 4 d, 
PEC were harvested by lavage with 20 ml of Hanks* 
balanced salts solution containing 20u/ml heparin. They 
were washed twice with this solution and then with 
RPMI-1640 medium. The washed cells were resuspended 
at 2xl0 6 cells/ml in RPMI-1640 medium supplemented 
with 10% FCS, lO^g/ml LPS and 0.1 ^g/ml indometh- 
acin. PEC suspension (10 ml/dish) was adhered to plastic 
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culture dishes (90 mm diam.). After 13.5 h, the adhered 
cells were used for RNA preparation. Total cellular RNA 
was isolated by the guanidine thiocyanate/cesium chloride 
centrifugation method (26). Sprague-Dawley rats were 
injected intravenously with 100/zg/kg of LPS. After 6h, 
total RNAs from tissues of rat were isolated by the 
guanidinium/hot phenol method (27) and then the 
guanidine thiocyanate/cesium chloride centrifugation 
method (26), Poly (A) + RNA was selected by oligo(dT)- 
cellulose column chromatography (28). 

Cloning of Rat IL-la cDNA— The oligo(dT)- tailed 
pcDVl plasmid primer and the oligo(dG) -tailed pLl linker 
fragment were prepared from plasmid pcDVl and pLl, 
respectively, according to the methods of Okayama and 
Berg (29, 30). The cDNA library was constructed by using 
the above pcDVl primer and pLl linker fragment with 
poly(A) + RNA according to the procedure of Okayama and 
Berg (30). The human IL-la cDNA insert (Ball-Mval 
fragment, 285 bp) from plasmid pcD-GIF-207 (12) was 
radiolabeled with [ar- 32 P]dCTP by nick translation using 
the kit from Amersham and was used to screen 68,000 
colonies of cDNA library. The hybridization solution was 
20% formamide, 50 mM sodium phosphate (pH6.5), 5X 
SSC (lxSSC = 0.15 M sodium chloride, 0.015 M sodium 
citrate), 0.2% polyvinylpyrrolidone, 0.2% Ficoll, 0.2% bo- 
vine serum albumin, 0.1% SDS, and 100 //g/ml denatured 
salmon sperm DNA. After overnight hybridization at 
37°C, the niters were washed in 0.4 x SSC and 0.1% SDS at 
50°C. Two clones hybridized to the probe and were purified. 
The cDNA inserts to be analyzed were subcloned into 
plasmid pUC118 (31). A series of pUC118 subclones 
containing deleted cDNA inserts was prepared by the 
deletion kit for kilo- sequencing from Takara Shuzo Co., 
Ltd. based on the procedure of Henikoff (32) and Yanisch- 
Perron et al. (33) before sequencing by the dideoxynu- 
cleotides chain termination method (34). 

Northern Blot Analysis — Poly (A) + RNA (5//g) dena- 
tured with glyoxal was fractionated through 1.2% agarose 
gel and transferred to a nitrocellulose filter (35). The blot 
was hybridized with the radiolabeled rat IL-la cDNA 
probe (Ball-Sacl fragment, 1.6 kb) by nick translation. 
Hybridization was carried out at 42°C in 50% formamide, 
5 x SSC, 50 mM sodium phosphate (pH 6.5), sonicated 
denatured salmon sperm DNA (100 /*g/ml), and 0.08% each 
of bovine serum albumin, Ficoll and polyvinylpyrrolidone. 
The filter was washed at 50°C in 0.1 x SSC and 0.1% SDS. 

Trans fection into Monkey COS-1 Cells— The plasmid 
was transfected into monkey COS-1 cells as described 
previously (12). 

Construction of pcDE Plasmid— The modified Okayama- 
Berg expression plasmid, pcDE, was constructed from 
pcDVl and pLl. The pcDVl (30) was digested with Kpnl 
and treated with T4 DNA polymerase to convert the 
protruding 3' ends to blunt ends. The blunt ends of the DNA 
were attached with EcoRl linker and digested with EcoRI 
and Hindlll. The large fragment containing the poly- 
adenylation signal sequence from the late region of SV40 
DNA and pBR322 DNA segment was purified by agarose 
gel electrophoresis. The plasmid pLl (30) was digested 
with Pstl and both cohesive ends were converted to blunt 
ends with T4 DNA polymerase. The DNA was ligated with 
EcoBl linker and digested with EcoRl and tfmdIII. The 
small fragment, containing the SV40 early region promoter 



and two introns, was purified and ligated with the large 
fragment derived from the pcDVl plasmid. The resulting 
plasmid, pcDE, has the single EcoRl site between the SV40 
DNA segment containing early region promoter — two 
introns and the SV40 DNA segment containing the poly- 
adenylation signal sequence. 

Expression of Mature Rat IL-la in E. coli— The expres- 
sion plasmid ptrp-RT-IL-la was constructed as shown in 
Fig. 4. The purified Haelll-Accl fragment (530 bp) con- 
taining rat II- la cDNA was blunt- ended by the Klenow 
fragment of E. coli DNA polymerase I. It was inserted into 
the filled-in Hindlll site of plasmid pTMl (provided by Dr. 
Imamoto, unpublished data) containing the E. coli trp 
promoter -operator sequence to construct pTMl-RT-IL- 
ltf(P). The EcoRI-BamHl fragment containing E. coli trp 
promoter-operator and rat IL-la- cDNA was inserted 
between the EcoRl and BamEI sites of pUC119 (31) to 
construct ptrp-RT-IL-lar(P). The single -stranded ptrp- 
RT-IL-lo-(P) was prepared with the infection of the helper 
phage M13K07 (31) and was annealed to the primer (5') 
GGGTATCGATAATGTC AGCACCT(3') . The deletion of 
the extra cDNA and the insertion of the initiation codon 
ATG were carried out using the kit of the oligonucleotide - 
directed in vitro mutagenesis system from Amersham. The 
resulting double -stranded DNA was transformed into E. 
coli JM109 (33) to obtain ptrp-RT-IL-la'. E. coli JM109 
harbouring ptrp-RT-IL-la- was grown for 9 h in M9 
medium (36) supplemented with 1% casamino acids, 0.4% 
glucose, 5 //g/ml of thiamine-HCl, 20 /zg/ml of L-cysteine, 
and 50 //g/ml of ampicillin. The E. coli cells were collected 
and lysed as described previously (13). Biological activity 
was measured for the E. coli cell extracts. 

SDS-PAGE— SDS -PAGE was performed on 15% gel by 
the methods of Laemmli (37). The molecular weight 
standards used to calibrate the gel were: phosphorylase b, 
94kDa; bovine serum albumin, 67 kDa, ovalbumin, 43 
kDa; carbonic anhydrase, 30 kDa; soybean trypsin in- 
hibitor, 20.1 kDa; lar-lactalbumin, 14.4 kDa. The gel was 
fixed and stained with Coomassie Brilliant Blue. 

Biological Assays for Rat IL-la — The assay for the 
growth inhibitory factor (GIF) against human melanoma 
cell line A375S1 has been described previously (38). The 
lymphocyte activating factor (LAF) activity was deter- 
mined by measuring the incorporation of [ 3 H] thymidine 
into BALB/c mouse thymocytes in the presence of PHA 
according to the procedure described by Oppenheim et al. 
(39). One LAF unit is equal to half the value of maximum 
uptake of [ 3 H] thymidine with human recombinant IL-1/? 
(13)- 

RESULTS 

Isolation of cDNA Clones — Since activated macrophages 
produce IL-1, we prepared rat PEC elicited by proteose 
peptone. The LPS -stimulated adherent cells, a population 

Hindi HaelH Haelll * AccI EcoRl SacI 

5 - \ r- ' i 1 1 (A)n3 - 

100bp 

Fig. 1. Restriction map of rat IL-la cDNA (pcD-RT-IL-la-6). 

The coding region is shown as a box. 
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virtually of activated macrophages (25), were used as the 
source of mRNA. A cDNA library was constructed by using 
the Okayama-BergcDNA expression vector. The probe was 
prepared from the 5' end of human YL-\a cDNA {Ball- 
Mvaly 285 bp), whose nucleotide sequence is much more 
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homologous to that of the 5' end of murine IL-l<y cDNA 
{40), and was used for the screening of the cDNA library 
under the condition of reduced stringency. Two clones were 
found to hybridize to the probe and were then purified. The 
cDNA inserts of the two clones were about 2.1 kb in length 



AAAGTCTCCAGGGCACAGAGGGAGTCAACTCATTGGCGCTTGAGTCGGCAAAGAAATCAAG 6 1 



Fig. 2. Nucleotide sequence and 
deduced amino acid sequence for 
the cDNA insert of clone pcD-RT- 
IL- 1 a -6. Nucleotide numbers are 
shown along the right-hand side of the 
sequence and the deduced amino acid 
sequence and corresponding position 
numbers are shown beneath. The 
polyadenylation signal, AATAAA, is 
underlined. 



ATG GCC AAA GTT CCT GAC TTG TTT GAA GAC CTA AAG A AC TGC TAT AGT GAA AAT GAA GAA 121 
Met Ala Lys Val Pro Asp Leu Phe Glu Asp Leu Lys Asn Cys Tyr Ser Glu Asn Glu Glu 
I 10 20 

TAG AGT TCT GCC ATT GAC CAT CTG TCT CTG AAT CAG AAA TCC TTC TAT GAT GCA AGC TAT let 
Tyr Ser Ser Ala lie Asp His Leu Ser Leu Asn Gin Lys Ser Phe Tyr Asp Ala Ser Tyr 

30 40 

GGC TCA CTT CAT GAG AAC TGC ACA GAT AAA TTT GTA TCT CTG AGA ACC TCT GAA ACA TCA 241 
Gly Ser Leu His Glu Asn Cys Thr Asp Lys Phe Val Ser Leu Arg Thr Ser Glu Thr Ser 

50 60 

AAG ATG TCC ACC TTC ACC TTC AAG GAG AGC CGG GTG GTG GTG TCA GCA ACA TCA AAC AAA 301 
Lys Met Ser Thr Phe Thr Phe Lys Glu Ser Arg Val Val Val Ser Ala Thr Ser Asn Lys 

70 80 

GGG AAG ATT CTG AAG AAG AGA CGG CTA AGT TTC AAT CAG CCC TTT ACT GAA GAT GAC CTG 361 
Gly Lys lie Leu Lys Lys Arg Arg Leu Ser Phe Asn Gin Pro Phe Thr Glu Asp Asp Leu 

90 100 

GAG GCC ATA GCC CAT GAT TTA GAA GAG ACC ATC CAA CCC AGA TCA GCA CCT CAC AGC TTC 421 
Glu Ala lie Ala His Asp Leu Glu Glu Thr lie Gin Pro Arg Ser Ala Pro His Ser Phe 

110 120 

CAG AAT AAT TTG AGA TAC AAA TTG ATA AGG ATC GTC AAG CAG GAG TTC ATC ATG AAT GAT 481 
Gin Asn Asn Leu Arg Tyr Lys Leu lie Arg lie Val Lys Gin Glu Phe lie Uet Asn Asp 

130 |40 

TCC CTC AAC CAA AAT ATA TAT GTG GAT ATG GAC AGA ATA CAT CTC AAA GCT GCT TCG TTA 541 
Ser Leu Asn Gin Asn lie Tyr Val Asp Met Asp Arg lie His Leu Lys Ala Ala Ser Leu 

150 160 

AAT GAC CTG CAG CTT GAA GTA AAA TTT GAC ATG TAT GCC TAC TCA TCG GGA GGA GAC GAC 60) 
Asn Asp Leu Gin Leu Glu Val Lys Phe Asp Met Tyr Ala Tyr Ser Ser Gly Gly Asp Asp 

170 180 

TCT AAA TAT CCT GTG ACT CTC AAA GTC TCA AAT ACT CAG CTC TTT GTG AGT GCT CAG GGA 661 
Ser Lys Tyr Pro Val Thr Leu Lys Val Ser Asn Thr Gin Leu Phe Val Ser Ala Gin Gly 

190 200 

GAA GAC AAG CCT GTG TTG CTG AAG GAG ATT CCG GAA ACA CCA AAA CTC ATC ACA GGT AGT 721 
Glu Asp Lys Pro Val Leu Leu Lys Glu lie Pro Glu Thr Pro Lys Leu lie Thr Gly Ser 

210 220 

GAG ACC GAC CTC ATT TTC TTC TGG GAA AAA ATC AAC TCT AAG AAC TAC TTC ACA TCC GCA 781 
Glu Thr Asp Leu lie Phe Phe Trp Glu Lys lie Asn Ser Lys Asn Tyr Phe Thr Ser Ala 

230 240 

GCT TTC CCA GAG CTG TTA ATT GCC ACA AAA GAA CAA AGT CAG GTG CAC CTG GCA CGG GGA 841 
Ala Phe Pro Glu Leu Leu lie Ala Thr Lys Glu Gin Ser Gin Val His Leu Ala Arg Gly 

250 260 

CTG CCC TCC ATG ATA GAT TTC CAG ATA TCA TAA AAACAGCCTTATTTAGGAGTCTACTTTCTTGAGAA 909 
Leu Pro Ser Met lie Asp Phe Gin He Ser 

270 

GTGCTGACAATCTCTATGTACCATGTACAGGAAATGTTCCTCGTCCTAAGTCACTCGCATGGCATGTGCTGAGTCTTTG 988 

TAATTCTAAATGAATGTTTACCCTCTTTGTAAGAGAAGAGCAAAGCCTAGTGGAACCAGCCCGACATATGATACTGTTT 10G7 

GTTATTTAAAGAGTACCCTATAGTTTGCTCACTACTAATCATTTTAAf TATTATTCTTCATGGTATTCTTAGGAGGGTC 1 1 46 

AAAAAGACTCTGCrCATATTACAGATGGGCTAACTAAGGGATAAGACAACTGAAAAGTACACTCAATTGCATCTGGAAT 1225 

GGGA ATTCACAGACCCATGA AACTT ATCTCACTGTGCACCTTCTGCTTCT AA AGTGCCGGATGGGT AGGAT AAAGTTGT 1 30 4 

AAGAACGTAATGCTATCATTTTCAAAAGGAAGGGGACAATAGCTACATCTTTCCTACCTCAGTGGGTTTTATTCCAGTA I3B3 

AGAGCATTTGGATAAAATCCTTCTGTAACAGACCTCAAGAAGGAGACAGACTGTTGAATGTTATTTTTAAGTTATTTTA 1 462 

T AT ATGTATTTATAAATATATTTATGATAATTATATTATTTATGGCACATCCTTAAATCCTCTGAGCTTGCCAGGCATC 154 1 

CTCAGCAGCAGAATTTTCTAGGTGGTCAGTTAGATGCAGTGTCCTCTGAAACACCATGCTATAGACAGACGTACACTGT 1620 

TTCCACATCCCTGGAa^CTCTTTACATTCTTGTCCTTAGGAGCCCTGCAGTCATGATCACCAATCrGTACTCT 1699 

TCGTTCATTTAAAATGATAACTGGGTCAGTCTTTTGCCCT^ 1778 

ACTCACCTGTAATTAAAGCTAACCATCTAAAGACGGGAAGAACCTAACTGTGACAACCACATCCCTGTTACCTGAAGTT 1857 

TCTTTTCTAGGATGTAATCAGTGTTTCCTCTGGGTTCCAAATTTGTTCTCAAACCAGGGTGTCACAAAAATATCAACAA 1936 

TAATAATCAACTCATCATTACTATTGATCATAATTA AATAAAG CAAGTTTGAGCTGAAAAAAAAAAAAAAAAAflAAAAA 20 I 5 

AAAAAAAAAAAAAAAAAA 
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Fig. 3. Northern blot analy- 
sis of rat IL-la mRNA. RNA 

was extracted from each tissue of 
rats injected without ( — ) or with 
( + ) LPS. M<£ represents RNA 
from LPS -stimulated rat peri- 
toneal macrophages. Five micro- 
grams of poly(A) + RNA was sub- 
jected to Northern blot analysis 
with the rat IL-la cDNA probe. 



TABLE I. Expression of recombinant IL-la in monkey COS-1 
cells. Individual plasmids were transfected into COS-1 cells using the 
DEAE-dextran method. After 72 h of incubation, the culture medium 
was harvested. The GIF activity and LAF activity in the culture 

supe rnatant were measured. 

GIF activity (u/ml) LAF activity (u/ml) 



pcD-RT-IL-la-6 
pcDE 



46.9 
0 



8 
0 



and indistinguishable from each other on the basis of 
restriction map (Fig. 1). One of the clones, pcD-RT-IL-ltf- 
6, was chosen for the analysis of the nucleotide sequences. 
As shown in Fig. 2, this cDNA insert is 1,992 bp in length 
excluding the poly(A) + tail flanking 14 bp 3' to a typical 
poly(A) addition sequence (AATAAA) (41). Northern blot 
analysis showed that the rat IL-lar cDNA probe hybridized 
to a single -size mRNA of approximately 2.2 kb in the 
mRNA of LPS-stimulated rat peritoneal macrophages, 
which suggests that the size of cDNA insert is nearly 
full-length (Fig. 3). 

The cDNA insert encodes a protein of 270 amino acid 
residues by the comparison of the rat cDNA with human 
IL- 1 a cDN A. The nucleotide sequence of the coding regions 
of this cDNA and human IL-1 or cDNA is 73% homologous, 
while greater homology (78%) is found between the probe 
region of human IL-la and the corresponding region of rat 
IL-la*. The amino acid sequence deduced from this cDNA 
sequence is homologous to human IL-lar (65%). 

Expression of IL-la mRNA in Rat—It is known that 
IL-1 is produced from various cell types stimulated with 
LPS (1). The expression of IL-1 mRNA in various tissues 
in vivo, however, has not been reported. We investigated 
the expression of IL-la mRNA of brain, lung, spleen, and 
liver of rat treated or not treated with LPS. As shown in 
Fig. 3, the expression of rat IL-la- mRNA in vivo was not 
detected in the brain, spleen, lung, or liver of untreated rat 
(data not shown for liver) . Rat IL- 1 a mRNA was expressed 
in small amounts in the spleen of LPS -treated rat, but not 
in the lung, brain, or liver (data not shown for liver). 

Expression of Rat IL-la in Monkey COS-1 Cells— To 
test the biological activities of the protein encoded in 
pcD-RT-IL-la--6, it was transfected into monkey COS-1 
cells, and the culture supernatant was assayed for IL-1 
activity. The IL-1 activity was tested by measuring the 
LAF and GIF activities. As shown in Table I, the super- 
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Fig. 4. Construction of a plasmid expressing the mature form 
of rat IL-1 a. The construction is described in detail in "MATERIALS 
AND METHODS." The solid box on the rat IL-lar cDNA indicates the 
coding sequence for the mature protein while the open box indicates 
that for the precursor region. M.C.S., multiple cloning sites; s.s., 
single stranded. 



natant of cells transfected with pcD-RT-IL-lar-6 exhibited 
the LAF and GIF activities, while neither LAF nor GIF 
activity was found in the case of the control plasmid pcDE 
lacking the cDN A insert. These data confirm that the cDNA 
insert of pcD-RT-IL-1^-6 encodes rat IL-lar. 

Production of the Mature Form of Rat IL-la in E. coli 
— Lomedico et al have defined the amino -terminus of a 
subset of biologically active murine IL-lar at Ser 115 (40). 
By matching the amino acid sequences of murine IL- 1 a and 
rat IL-1 a- shown in Fig. 6, it corresponds to Ser 115 in the rat . 
IL-la sequence. We constructed the expression plasmid to 
direct the synthesis in E. coli of the carboxy terminal region 
encompassing the last 156 amino acids, beginning with the 
residue Ser 115 of rat IL-la-, to prove the biological activity 
residing in the carboxy terminal region of rat IL-l<ar. The 
structure and the construction of the expression plasmid 
ptrp-RT-IL-lor is shown in Fig. 4. When the E. coli JM109 
harbouring this plasmid was grown under conditions to 
derepress the tryptophan promoter, a large amount of rat 
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IL-ltf mature protein was synthesized in E. coli, as shown 
in Fig. 5. The cell extract exhibited a high level of IL-1 
activity, showing 5.6 X 10 6 units of GIF/ml of culture and 
2xl0 6 units of LAF/ml of culture. No IL-1 activity was 
found in cells transformed with the control plasmid pTMl 
lacking the rat lh-la sequence. 

DISCUSSION 

The cDNA of pcD-RT-IL- 1 a isolated with the 5'-end region 
of human IL-1 or cDNA as the probe from the cDNA library 
prepared with mRNA derived from LPS- stimulated rat 



a b c 




Fig, 5. The expression of recombinant rat IL-1 a on SDS- 
PAGE. E. coli cells grown for 9 h under the derepression of the trp 
promoter were collected and dissolved in Laemmli sample buffer (37) 
and electrophoresed in 15% polyacrylamide gel. The arrow shows the 
mature form of rat IL-1 or produced in E. coli. Lane 1, molecular 
weight markers; lane 2/ total protein from E. coli JM109/pTMl 
(without rat IL-1** cDNA); lane 3, total protein from E. coli JM109/ 
ptrp-RT-IL-lo- (with rat IL-1** cDNA). 



peritoneal macrophages encodes a protein which is homol- 
ogous to human IL-1 at in the amino acid sequence. It 
exhibited the GIF and LAF activities with the transfection 
into COS-1 cells. These data indicate that this cDNA 
encodes rat IL-1 or. 

The expression of IL-1 a 1 mRNA has been reported to be 
induced by LPS in peripheral blood monocytes ( 3) and with 
PMA and LPS in human histiocytic lymphoma cell line 
U937 (42). However, the level of IL-1 a mRNA is lower 
than that of IL-lyff mRNA induced simultaneously. On the 
other hand, keratinocytes have been demonstrated to 
contain more IL-1 a mRNA than IL-ljff mRNA in the 
absence of apparent stimulation (43). Our results shown in 
Fig. 3 indicate that IL-1 a mRNA is not induced in normal 
rat tissues nor even in tissues of LPS -treated rat except the 
spleen, although IL-l^ mRNA is induced in LPS- 
stimulated peritoneal macrophages- We have observed that 
IL-ly9 mRNA is induced in the brain, spleen, lung, and liver 
of LPS -treated rat (unpublished data). Therefore our data 
suggest that LPS is not sufficient to induce IL- 1 a mRNA in 
these organs, in contrast to monocytes. 

Fontana et al. have reported that IL-1 activity was 
detected in the brain of LPS -treated mice (44). Giulian and 
Lachman have reported that IL-1 activity was detected in 
injured rat brain (45). Recently Farrar et al. have de- 
scribed that they detected IL-1/? mRNA, but not IL-la 
mRNA, by in situ hybridization throughout the rat brain, 
but they did not describe what species of IL-1 cDNA probe 
they used and they have not observed the expression of 
IL-1 a mRNA in the brain of LPS-treated rat (46). The 
results of our experiment using rat IL-1 or and fi cDNA 
probes indicate that IL-l<r mRNA is not expressed con- 
stitutively nor induced in the brain (Fig. 3), but IL-1/? 
mRNA is induced in the brain of LPS-treated rat (unpub- 
lished results). Therefore IL-1 produced in the rat brain is 
thought to be the (3 type. It is important to understand the 



Rat MAKVPDLFEDLKNCYSENEEYSSAIDHLSLNQKSFYDASYGSLHENCTDK 50 

Murine D • T Q 50 

Rabbit ..... f . EP • * * D -MN • 50 

Human M D--S HV • • * P • • • G - M ■ Q 50 

Rat FVSLRTSETSKMSTFTFKESRVVVSATSNKGKILKKRRLSFNQPFTEDDL 100 

Murine N T SN * SET 100 

Rabbit V***S VSPNL • *Q* NV ■ A-T S L---I-DV*- 97 

Human S*-'SI T-KL M*-*AT N--V LS-SI-D--- 97 



Fig. 6. The comparison of the amino 
acid sequence of rat IL-1 a with IL-1 a 
sequences from murine (40), rabbit 
(4), and human (3, 4, 12). The letters 
represent the single-letter abbreviations 
for amino acids. The dots represent an 
amino acid residue that is identical to the 
residue shown for rat IL-1 or. The aster- 
isk shows the putative amino terminal 
amino acid residue of the mature form of 
rat IL- 1 a . The closed triangles indicate 
amino acid residues which are conserved 
among murine, rabbit, and human, but 
not in rat. Gaps ( — ) have been inserted 
to achieve maximum homology. 



Rat EAIAHDLEE-TIQPRSAPHSFQNNLRYKLIRIVKQEFIMNDSLNQNIYVD 14 9 

Murine QS-T - YTY • SD • • * - * MKL • R • K • V 14 9 

Rabbit -TNVS *P * • GI -K • • • V • YT • • R-M* • • YL* • I • * • • TL* • A • • • SLVR • 147 

Human • • • • N • S • • EI • K F • • LS • VK * NFM • • I • Y • • • L • • A • • " S ■ IR- 146 

T ▼ 

Rat MDRIHLKAASLNDLQLEVKFDMYAYSSGG-DDSKYPVTLKVSNTQLFVSA 19 8 

Murine V-KHY-STTW Q - I-DS 198 

Rabbit TSDQY-Q* - P-QN-GDA GV • -MTS-E • • IL -RI*Q-P 195 

Human ANDOY * T ■ * A • HN • DEA G • • -KSSK • • A • IT • I • RI • K • • ■ Y • T • 195 

T 

Rat QGEDKPVLLKEIPETPKLITGSETDLIFFWEKINSKNYFTSAAFPELLIA 24 8 

Murine • • * *Q L • KS y...p.. 248 

Rabbit *N-*E M RI-*D-*S-IL TQGN • *--K N *Q» F • • 245 

Human -D-'Q M--I--T N • L • • * * THGT V-H-N-F* • 245 

▼ 

Rat TKEQSQVHLARGLPSMIDFQIS 270 

Murine R T 270 

Rabbit • -PEHL- -M T 267 

Human • *QDYW*C - *G • P • • IT- • • • LENQA 271 
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difference of the expression of IL- 1 a and /? mRNAs in the 
brain. Uehara et al. have reported that human recombinant 
IL-lyS increased the plasma levels of ACTH in rats, 
whereas human recombinant IL-1 a did not (47) . It has also 
been demonstrated that recombinant human IL-1/? in- 
creased the ACTH content of the anterior pituitary and 
corticotropin releasing hormone (CRH) content of the 
hypothalamus and adrenal glands in rats, but recombinant 
human IL-1 a had no such effects (Y. Naitoh et al. t submit- 
ted). It is interesting that their data suggest the difference 
of the function between IL-ltf and IL-1/? in the brain, 
which might relate to the difference of expression of IL-1 a 
and P mRNAs in the brain. The availability of recombinant 
rat IL-ltf will contribute to confirming their data. 

The comparison of the amino acid sequence deduced from 
rat IL-la cDNA with those of murine, rabbit, and human 
IL-l^s is shown in Fig. 6. In amino : acid sequence, rat 
IL-l<r is homologous to human IL-la (65%), rabbit IL-la- 

When the amino acid sequence of rat IL-la- is divided 
into the precursor region (Met'-Arg 114 ) and the mature 
region (Ser 115 -Ser 270 ), the precursor region is much more 
homologous to human IL-ltf (74%), rabbit IL-1 a (71%), 
and murine IL-la (89%). We have no idea of the signifi- 
cance of the greater conservation of the precursor region 
than of the mature region in IL-l^. Recently it has been 
shown that both the precursor form and the mature form of 
human IL-la bind to the IL-1 receptor (48) and that 
membrane- associated, biologically active IL-1 is a type, 
not p type (49). These functions of IL-la might relate to 
the conservation of the precursor region in IL-la. 

There is another feature of interest in the amino acid 
sequence data: at four sites the same residue is found in the 
murine, rabbit, and human sequences, whereas there is a 
change in the rat gene (Fig. 6). This is due to the substitu- 
tion of one nucleotide in the codon between rat and murine. 
These amino acid residues lie in the mature region of rat 
IL-la and hence might not be essential for the biological 
activity. 

The rat IL-la, encoded in the cDNA, exhibited the IL-1 
activity upon the transfection into COS-1 cells, but its 
amino terminal amino acid has not been determined. How- 
ever, the mature rat IL-la beginning at Ser 115 expressed 
in E. coli exhibited the IL-1 activity. This recombinant 
IL-la from E. coli has been purified. Its specific activity is 
about 1 X 10 7 GIF units/mg protein and its amino terminus 
is a serine residue (Y. Masui et a/., unpublished results), 
which indicates that the carboxy terminal region of rat 
IL-la is sufficient for the IL-1 activity, as reported for 
murine IL-la (40) and human IL-la (3, 14). 

We measured the LAF activity as IL-1 activity. We also 
measured the GIF activity since recombinant human IL- 1 
a and /? are known to inhibit the growth of human 
melanoma cell line A375 (12, 38, 50). Recombinant rat 
IL-la has also the GIF activity, suggesting, that GIF 
activity is common to IL-ls derived from various sources. 

We have already obtained the recombinant rat IL - 1 a and 
the antibody against rat IL-la. These materials and the 
cDNA will contribute to clarifying the regulation of gene 
expression and the biological activities of IL- 1 in vivo. 

We thank Dr. Fumio Imamoto (The Institute of Physical and 
Chemical Research) for generously providing plasmid pTMl, and Mr. 
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THE RAT INTERLEUKIN-5 GENE: 
CHARACTERIZATION AND EXPRESSION BY 
RETROVIRAL GENE TRANSFER AND 
POLYMERASE CHAIN REACTION 

Klaus Uberia, 1 Weiqun Li, 1 Zhihai Qin, 1 Gunther Richter, 1 
Thomas Raabe, 2 Tibor Diamantstein, 1 Thomas Blankenstein 1 * 



The rat interleukin-5 (IL-5) gene was isolated from a genomic lambda phage library and a 
fragment containing all four exons was inserted into the retroviral vector pXTl, resulting in 
pXTRILS. Upon retroviral gene transfer into two IL-5-dependent mouse cell lines, B13 and 
T88M, autonomously growing cells were established and B-cell growth factor activity was 
detected in the supernatants of the infected cells. "cDNA" versions of the rat IL-5 gene were 
rescued by the polymerase chain reaction (PCR) with primers specific for the flanking 
regions of the cloning site in pXTl. Restriction or DNA sequence analysis of five different 
clones revealed precise splicing in two cases, while three of the clones had retained the first 
intron. In addition, in two of these about 400 bp of rat IL-5 '5' flanking regions were deleted. 
The sequence comparison of rat, mouse, and human IL-5 genes revealed a high degree of 
con^rv^on^ homologous ait the amino acid level). The 

combTnatiOT^ and PCR may offer an alternative, efficient method 

for the cloning of cDNAs. 
Copyright © 1991 by W.B. Saunders Company 



Interleukin-5 (IL-5) was originally described as 
B-cell growth factor (BCGF) or T-cell replacing factor 
(TRF). 1,2 Mouse and human IL-5 genes have been 
isolated and characterized at both the cDNA and the 
genomic level, 3 6 Comparison of rat and mouse interleu- 
kin-4 (IL4) genes recently revealed an unusual high 
degree of divergence. 7 Both interleukins, IL-5 and 
IL-4, are synthesized simultaneously and predomi- 
nantly by TH2 cells 8 and their genes are located close 
to each other on the same chromosome. 910 Both lym- 
phokines stimulate growth and affect differentiation of 
B cells. 11 In this report we wish to extend our compara- 
tive study of Iymphokine genes. In order to circumvent 
conventional cloning of the rat IL-5 cDNA we chose an 
alternative approach. Shimotohno and Ternin 12 re- 
ported the loss of intervening sequences from a retrovi- 
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ral vector upon retroviral passage. Subsequently, a 
number of groups demonstrated the possibility of 
generating a "cDNA" of genomic genes through retro- 
viral gene transfer. 13 ' 18 The rescue of such "cDNAs" 
was hampered by undetectable gene expression, 19 by 
inefficient or irregular splicing, 131519 and by time- 
consuming and laborious protocols. 1315 ' 17 Therefore, 
this approach did not seem to be useful for the 
generation of cDNAs of unknown genes. The introduc- 
tion of the polymerase chain reaction (PCR) technique 
into molecular biology 20 has facilitated the analysis of 
large numbers of different proviral integrations. The 
detection of completely spliced cDNAs should be 
possible, even without knowledge of the sequence 
which had been cloned into the retroviral vector. Using 
the combination of retroviral gene transfer and PCR 
we report here the isolation and characterization of the 
rat IL-5 "cDNA." 

RESULTS 

Identification and Isolation of the Rat IL-5 Gene 

EcoRI- and BamHI-digested liver DNA of Lewis 
rat and BALB/c mouse were analysed by Southern blot 
hybridization using a mouse IL-5 cDNA probe (Fig. 1). 
A single hybridizing EcoRI fragment of 8 kb and two 
BamHI fragments were obtained from rat DNA, which 
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Figure 1. Southern blot analysis of BALB/c mouse (1) and Lewis rat 
(2) liver DNA. 

EcoRI- and BamHI-digested DNAs were hybridized with the 630-bp 
BamHI-AccI mouse IL-5 cDNA fragment of pSP6K-mTRF23. Size 
marker was Hindlll-digested lambda DNA. 



is consistent with a single copy gene in the rat genome. 
Similar hybridization intensities in comparison to 
BALB/c indicated a high degree of homology between 
rat and mouse IL-5 genes. A genomic library was 
constructed with Lewis rat liver DNA, partially di- 
gested by Sau3 A. Recombinant plaques (1 x 10 6 )were 
screened with a mouse IL-5 cDNA probe. A clone was 
isolated which contained the 8-kb IL-5-specific EcoRI 
fragment. Further characterization by restriction and 
Southern blot analysis with mouse exon-specific probes 
indicated a genomic organization of the four exons 
similar to that in the mouse (Fig. 2). 

Construction ofpXTRILS and Retroviral Expression 
of the Rat IL-5 Gene 

A 4.1-kb AccI-HincII fragment containing some 
hundred base pairs of 5' flanking regions, all rat IL-5 
coding regions, and most of the 3' untranslated region 
was inserted by several manipulations into the Bglll 
cloning site of the retroviral vector pXTl, 21 resulting in 
pXTRILS (Fig. 2). This vector contains the neomycin 
gene as a selectable marker driven by the Moloney 
murine leukemia virus long terminal repeat (MoMuLV- 
LTR) and the rat IL-5 gene expressed from the 
internal herpes simplex virus-derived TK promoter. 
Psi 2 -virus-producing cell lines were generated by trans- 
fection with pXTRIL5. A viral stock of the clone 
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psi2-XTRIL5, which produces 1 x 10 4 neo R colony 
forming units (CFU)/ml, was used to infect the two cell 
lines B13 22 and T88M, 23 whose growth is dependent on 
exogenous IL-5. The infected cells were first selected 
for neo R and subsequently for IL-5 expression by 
omitting IL-5. From the cloning efficiency in the 
presence or absence of exogenous IL-5 we estimate 
that about 10% of the neo R B13 or T88M cells were 
capable of growing factor independently. Since we 
could not detect spontaneous factor-independently 
growing cells, we concluded that the majority of IL-5- 
independent B13 or T88M cells were infected by the 
retrovirus pXTRIL5, leading to the expression of the 
rat IL-5 gene. To test this assumption, culture superna- 
tants from five B13-derived (BR5) and eight T88M- 
derived (TR5) clones were assayed for IL-5 activity by 
a [ 3 H]-thymidine incorporation assay using T88M cells. 
The results for two BR5 and two TR5 clones are shown 
in Fig. 3 and demonstrate the presence of rat IL-5 
acting across the species barrier on mouse cells. The 
proliferation of the indicator cells was IL-5 dependent 
because it could be inhibited (up to 70%) by the 
anti-IL5 receptor monoclonal antibody (mAb) 
R52.120, 22 but not by the iso type-matched anti-IL-4 
mAb 11B11. In a control experiment, the same amount 
of 11B11 blocked the IL-4-dependent proliferation of 
CT.4S cells about 50%, whereas R52-120 had no effect, 
excluding a direct cytotoxic effect of R52-120. All 
clones tested expressed IL-5 activity. The bulk culture 
of BR5 cells secreted 0.3 U of IL-5 per milliliter and of 
TR5 cells 148 U of IL-5 per milliliter. Consistently, 
TR5 clones produced considerably more IL-5 than the 
BR5 clones. The psi2-XTRIL5 cells produced 2 U of 
IL-5 per milliliter, which should result from the expres- 
sion of the genomic rat IL-5 gene (not shown). 

Analysis of the Proviral Genome and Rescue of 
Retroviral IL-5 Sequences from BR5 and TR 5 Cells 

A Southern blot analysis was performed using the 
IL-5 probe, in order to examine the splicing of the rat 
IL-5 introns during the retroviral gene transfer. Upon 
EcoRI digestion a new 4.3-kb band which has the 
expected size of the retroviral "genomic" IL-5 frag- 
ment appears in psi2-XTRIL5 in addition to the 
endogenous 9.4- and 4-kb bands (Figs. 2, 4a). Other 
bands in psi 2 and psi2-XTRIL5 most likely result from 
cross- hybridization of pBR322 sequences in the probe 
with helper virus plasmid DNA in psi 2 . In all XTRIL5- 
infected cells (four TR5, two BR5 clones) a new, lower 
molecular weight EcoRI band appears in addition to 
the presumably not rearranged endogenous bands. 
The 1.9-kb EcoRI fragment in TR5-1 to TR5-4 corre- 
sponds to the expected size of a retroviral "cDNA" 
IL-5 fragment whose introns were spliced precisely in 
the virus-producing cell line, although the band in 
TR5-2 seemed slightly larger (see below). In BR5-2 the 



74 / Uberla et al 




NE, Vol. 3, No. 1 (January 1991: 72-81) 



XRIL5 



E HA He/ 



AHcP 



A He A B N A SP^ He E 



JLX 



il 




4.2 Kb 



pXTRILS 



U f 

u 



LTR pBR322 



w /// 



12 3 4 



1.9 Kb 



1Kb 



Figure 2. Schematic diagram showing the generation of a rat IL-5 "cDNA" by retroviral passage. 

From top to bottom: lambda phage containing the genomic rat IL-5 gene and (below) a detailed restriction map 
of the 8-kb EcoRI fragment harboring the coding region (black boxes) and most of the 3' untranslated region 
(open box). A 4.1-kb AccI-HincII fragment was cloned into the Bglll site of pXTl as described in Materials and 
Methods, giving rise to plasmid pXTRILS. Only those EcoRI restriction sites are shown that are indicative of a 
rearranged IL-5 gene. A virus producing fibroblast ceil line psi 2 was generated by transfection with pXTRILS and 
the virus was used to infect cell lines T88M and B13 from which IL-5 "cDNA" versions could be rescued. E, 
EcoRI; H, Hindlll; A, AccI; He, Hindi; P, PstI; B, BamHI; N, Ncol; S, Smal; LTR, long terminal repeat; neo, 
neomycin; TK-P, thymidine kinase promoter; SD, splice donor site; (SA), cryptic splice acceptor site. 



retroviral IL-5 band is 2.4 kb, suggesting different 
processing of the XTRIL5 mRNA. BR5-1 reveals 
additional 1.9- and 2.4-kb bands, indicating either that 
two viral integrations occurred or, what is more likely 
considering the reduced hybridization intensities of the 
additional bands, that BR5-1 is a mixture of two clones. 
In order to be sure that a functionally spliced IL-5 gene 
would be isolated from the genome of the infected 
cells, we wanted to confirm that only a single virus had 
been integrated. A Southern blot analysis with Bcll- 
digested DNA shows, in comparison to T88M, one 
additional IL-5 band for the four TR5 clones, which is 
consistent with a single proviral integration, since Bell 
cuts only once within the proviral genome (Fig. 4c). 

Next, fragments containing IL-5 sequences were 
rescued from XTRIL5 infected cells by PCR using 
primers specific for the TK promoter and MoMuLV 
sequences, which are located 5' and 3' of the Bglll 
cloning site in pXTl, respectively. In addition, the 
primers contain cloning sites for either EcoRI or 
BamHI. The amplification products obtained from 
DNA of TR5-1 and BR5-2 were analyzed by a South- 
ern blot using the IL-5 probe. After a 15 min exposure, 
bands appear corresponding in size to the retroviral 
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Figure 3. Cell proliferation assay measured by [ 3 H] -thymidine 
uptake using the IL-5 indicator cell line T88M in the presence of 
mouse IL-5 or diluted supernatants of cell lines as indicated. 

(1) IL-5-dependent cell proliferation; (2) in the presence of anti- 
IL-5R mAb R52.120 (1:500 diluted ascites); (3) in the presence of 
anti-IL-4 mAb 11B11 (1:500 diluted ascites). The black bars show 
the IL-4-dependent proliferation of cell line CT.4S in the presence 
or absence of antibodies. 
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Figure 4. Analysis and rescue of the rat IL-5 "cDNA" generated by retroviral passage. 

(A) EcoRI-digested DNA of indicated cell lines were hybridized in a Southern blot with the IL-5 probe For the 
interpretation of altered IL-5 bands in TR5 and BR5 clones in comparison to psi r XTRIL5 see also Fie 2 08) 
PCR analysis of XTRILS-infected T88M (TR5-1) and B13 cells (BR5-2). Amplification products from 50 ng of 
genomic and 80 pg of plasmid input DNA were electrophoresed on an agarose minigel after 26 cycles of 
amplification using primers specific for the TK promoter and MoMuLV sequences and subjected to Southern 
blot analysis using the IL-5 probe. Exposure time was 15 min. The size of the PCR products is 0.1 kb smaller than 
the corresponding EcoRI bands m (A) as indicated by the connecting arrows. (C) Bell-digested DNAs of T88M 
c" t i R5 T ceIIs wer 5 T anal yf e d *n a Southern blot using the IL-5 probe. The probe in A, B, and C was the 450-bp 
DNA m ° USe fragment of P SP6K-mTRF23, and the size marker was lambda Hindlll-digested 



IL-5 fragments of the respective parental lines (Fig. 
4b). Subsequently, the PCR-amplified rat IL-5 genes of 
five cell lines (TR5-1, TR5-2, TR5-3, BR5-2, BR5-3) 
were cloned into suitable vectors. 

Characterization of the Rat IL-5 "cDNA" 

The five rat IL-5 inserts were characterized by 
restriction analysis and partial sequence analysis. Clone 
TR5-1, containing the smallest proviral IL-5 fragment, 
was sequenced completely by the sequencing strategy 
outlined in Fig. 5. The sequence contains an open 
reading frame of 132 amino acids, which are 92% 
homologous to the mouse IL-5, 500 bp of 5' flanking 
region, and 0.7 kb of 3' untranslated region (Fig. 6). In 
addition, the sequence in Fig. 6 extends 0.3 kb in the 3' 
direction up to the EcoRI site (see Fig. 2) and ends at a 
position which is homologous to a region 20 bp 5' of the 
polyadenylation signal sequence in the mouse IL-5 
gene. Sequencing of parts of other rescued "cDNA's" 
and genomic regions confirmed the sequence and 
demonstrated the presence of three introns flanked by 
splice donor and acceptor sites. We also noted a C-to-T 
mutation in the second position of amino acid 126 in 
TR5-1, leading to a change from threonine to isoleu- 
cine and a G-to-A mutation at position 1011 (see Fig. 
6). The composite analysis of the five clones reveals 
three types of rat IL-5 sequences that have been 



rescued from the XTRIL5 infected cells (Fig. 5). 
TR5-1 and TR5-3 contain the expected, precisely 
spliced cDNA without deletions. Clone BR5-2 has 
retained the first intron. Similarly, TR5-2 and BR5-3 
still possess the first intron but, in addition, have 
deleted most of the 5' flanking region. In BR5-3 the 
deletion is 408 bp in length from nucleotides 60 to 467 
(see Fig. 6). No adjacent splicing recognition se- 
quences could be detected. 

Identification of a Retrotransposon Inserted into the 
First Intron of the Mouse IL-5 Gene After 
Divergence from the Rat 

Upon comparison of the overall genomic structure 
of human, rat, and mouse IL-5 genes and by sequenc- 
ing of the rat IL-5 intron 1 region we noticed a gap of 
303 bp within the rat sequence when compared to the 
mouse (Fig. 7). The insertion in mouse IL-5 intron 1 is 
flanked at either side by regions highly homologous 
between mouse and rat IL-5 genes and carries such 
features characteristic of retrotransposons as a 12-bp 
direct repeat most likely resulting from a target site 
duplication and a poly(A) tract at one end. The size of 
the first intron of the IL-5 gene in a number of 
laboratory and wild mice was analyzed in order to 
determine the time of the retrotransposon integration. 
To accomplish this, their genomic DNA was amplified 
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Figure 5. Schematic illustration showing the rat UL-5 regions 
rescued by PCR from five XTRIL5 infected cell lines. 

Numbers refer to exons (black boxes). 5' flanking and intron regions 
are shown as white and 3' untranslated region as dotted boxes. 
Arrows indicate the sequencing strategy. The asterisks indicate 
mutations identified in TR5-1. 



by PCR using primers specific for the first and the 
second exon of mouse and rat IL-5 genes (Fig. 8, top). 
The results show amplification products of about 950 
bp for all the genomic DNA samples with the exception 
of those of the mouse Mus coelomys pahari and the rats. 
This fragment is indicative of the presence of the 
retrotransposon. The 650-bp fragment obtained with 
rat DNA shows the absence of the retrotransposon. 
The 1400-bp fragment amplified with M.c. pahari DNA 
indicates that either additional or different mutation 
events have occurred in the IL-5 intron 1. To confirm 
that specific fragments had been amplified, the agarose 
gel was subjected to Southern blot analysis using a rat 
intron 1-specific probe, demonstrating the expected 
hybridization (Fig. 8, bottom). 

DISCUSSION 

Our results demonstrate the useful combination of 
retroviral gene transfer and PCR for the characteriza- 
tion of genomic and cDNA regions and the parallel 
expression of genes previously analyzed in other spe-. 
cies. This approach relies -on the fact that genomic 
libraries from a large number of different species are 
available, that a probe from a related species exists to 
isolate the genomic region, and that intron sequences 
are removed during the retroviral life cycle. Since the 



retroviral vector sequences ace known, the isolation of 
the "cDNA" is considerably facilitated by PCR, which 
furthermore restricts the generally time-consuming 
DNA sequence analysis to the informative exon re- 
gions. Following this experimental outline, we describe 
the isolation of the rat IL-5 gene, the generation and 
expression of its cDNA by retroviral gene transfer, the 
rescue of the cDNA by PCR, and the determination of 
its sequence. Target cells for the retroviral infection 
were IL-5-dependently growing cell lines, B13 and 
T88M. This made possible the selection of cells that 
had acquired a virus containing a functional IL-5 gene. 
In fact, all clones tested produced IL-5 between 2 and 
> 1,000 U. On the other hand, the number of virus 
particles transfering an aberrantly spliced IL-5 gene 
seemed to be low, since the frequency of rat IL-5- 
independent G418 R cells was similar to that of cells 
infected with a retrovirus that contained a mouse IL-5 
cDNA region (not shown). 

The rescue of the five rearranged proviral rat IL-5 
sequences shows two cases in which all three introns 
were spliced. In three other cases the first intron was 
retained but the second and third introns were deleted. 
Two of these suffer from a deletion of the rat IL-5 
"promoter" region. Despite the existence of these 
differently reassembled functional IL-5 regions, we 
could not detect a correlation between any unusual 
processing and level of IL-5 activity. Therefore, selec- 
tion presumably does not account for the identification 
of incomplete splicing of the first intron in three out of 
five cases. It could, rather, reflect a certain order 
during processing of the primary IL-5 transcript, or, 
alternatively, the preferential packaging of certain 
retroviral transcripts. To address this question in more 
detail, we are currently analyzing — again with the help 
of PCR — the content of retrovirus-specific transcripts 
which are generated in the virus-producing cell line, 
which are encapsicated and released into the culture 
medium, which are able to infect the target cell, and 
which contain a functional IL-5 gene. 

A disadvantage of the approach used is the high 
mutation rate arising from the use of the reverse 
transcriptase or, more likely, the Taq polymerase. 
Indeed, two mutations were detected in the sequence 
of TR5-1, one of which led to an amino acid exchange 
in position 126. The mutation rate of 10~ 3 (2/ -2000 
bp) is compatible with the error rate of the Taq 
polymerase and may be overcome by direct sequencing 
of the amplified products. 24 

The rat IL-5 gene was characterized because we 
recently noticed an unusual high divergence between 
mouse and rat IL-4 genes (57% amino acid homol- 
ogy). 7 Both lymphokines, IL-4 and IL-5, are simulta- 
neously synthesized by T lymphocytes preferentially of 
the TH2 type. 8 Both stimulate growth and affect 
differentiation of B cells. 11 In mouse and human the 
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Figure 6. Nucleotide seqeunce of the rat IL-5 
"cDNA" isolated from the cell line TR5-1. 

Coding regions have been deduced by comparison to 
the mouse IL-5 gene and are indicated by amino acid 
translation above the sequence. The sequence 3' of 
the Hindi site at position 1593-1603 was derived 
from the rat IL-5 genomic fragment. The position for 
consensus sequences of a TATA box is marked by a 
closed box, the transcription initiation site deduced 
from the mouse IL-5 gene is indicated by a horizon- 
tal arrow, and signal sequences for potential mRNA 
destabilization signals are underlined. The muta- 
tions in TR5-1 are shown below the sequence. 
Vertical arrows indicate the endpoints of the dele- 
tion in the promoter region of BR5-3. These se- 
quence data have been submitted to the EMBL/ 
Gene Bank Data Libraries under the accession 
numbers X54419. 
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IL-4 and IL-5 genes are located close to each other on 
the same chromosome. 9,10 In contrast to the IL-4 genes, 
the comparison of rat and mouse IL-5 genes shows 
their, strong eyolutionary^conseryation: both. consist of 
a. very similar exon/intron structure jind size and their 
exoa regions are 94 and 92% homologous to each other 
^ trinucleotide, and .the amino acid level, respectively 
(Fig. 9). Consistently with this is the fact that the rat 
IL-5 acts on mouse cells, whereas the IL-4 genes, which 
show low homology between mouse and rat, do not act 
across the species barrier. Rat and human IL-5 pro- 
teins are 68% homologous. The 500-bp presumed 
promotor regions are 83% homologous between mouse 
and rat. Twelve base pairs 5' of a potential TATA box 
the rat IL-5 gene contains a consensus region which is 
found within the promotor region of the hematopoetic 
growth-factors, granulocyte colony-stimulating factor 
(G-CSF), granulocyte-macrophage colony-stimulating 



factor (GM-CSF), and IL-5. 6 In the 3' untranslated 
part, the rat IL-5 sequence contains four ATTTA 
motifs which are highly conserved among cytokine 
genes and which may be involved in rapid degradation 
of the IL-5 mRNA. 25 

Upon comparison of mouse and rat IL-5 genes we 
noticed the insertion of a 303-bp fragment into the 
mouse intron 1 region. The presence of a target site 
duplication and a poly (A) tract support the assumption 
that the fragment is integrated by- retrotransposition. 
The retrotransposon is not homologous to any known 
sequence, except for a 110-bp region consisting of Cs 
and T's exclusively. Although abundantly present in 
the genome, this repetitive sequence has not yet been 
associated with retrotransposition. PCR amplification 
and size determination of the IL-5 intron 1 region in a 
number of different murine species indicated that the 
retrotransposon seems to be present in all species but 
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Figure 7. Comparison of human, rat, and mouse IL-5 genes. 

(Top) Comparison of genomic regions of human, rat, and mouse IL-5 genes. Coding regions are marked by black, 
3' untranslated regions by open boxes. The numbers of amino acids which encode the individual exons are 
indicated. The integration of a retrotransposon (RT) in mouse intron 1 is marked by a small open box. (Bottom) 
Comparison of mouse and rat IL-5 intron 1 sequences close to the region which flanks the retrotransposon in the 
mouse. The direct repeats are boxed, the potential polyadenylation signal is underlined. The poly(A) stretch is 
marked by an arrow. The distances to exon 1 and 2 in both mouse and rat genes are indicated. The sequence of rat 
IL-5 intron 1 has been submitted to the EMBUGene Bank Data libraries under the accession number X52845. 



the rat. Therefore, the retrotransposon should have 
integrated into the IL-5 intron 1 after mouse and rat 
diverged from each other but before most murine 
species separated from each other. 

MATERIALS AND METHODS 

Construction and Screening of a Rat Genomic 
Library 

Twelve- to twenty-kB enriched fragments of partially 
Sau3A-digested Lewis rat liver DNA were ligated into 
BamHI-digested EMBL3 arms and packaged in vitro result- 
ing in 1.5 x 10 6 recombinant phages. EMBL3 arms and 
Gigapack Gold packaging extracts were purchased from 
Stratagene, La Jolla, CA. Plaque hybridization to the 630-bp 
BamHI-AccI IL-5 fragment of plasmid pSP6K-mTRF23, 
lambda phage growth, plaque purification, lambda DNA 
extraction, restriction analysis, and subcloning of restriction 
fragments were done according to standard procedures. 26 

Generation of the Rat IL-5 Recombinant Retrovirus 

The retroviral vector pXTRILS was constructed by 
inserting a 4.1-kb AccI -Hindi fragment containing the rat 
IL-5 gene into the plasmid pXTl behind the thymidine 
kinase (TK) promoter. 21 First, a 0.6-kb AccI-HincII fragment 



containing 500 bp of the 5' region and part of exon 1 was 
blunt-ended with Klenow polymerase, inserted into HincII- 
Smal pretreated pBluescript plasmid, and excised as a 
BamHI -Hindi fragment. Second, a 3.5-kb Hindi fragment 
which contains parts of exon 1, exon 2, 3, and 4, and 700 bp of 
the 3' untranslated region, but which has the last 300 bp and 
the polyadenylation signal deleted, was cloned into the 
Hindi and Smal sites of pBluescript and recovered by Hindi 
and partial BamHI digestion. Finally, the 0.6-kb BamHI- 
Hindl 5' fragment, the 3.5-kb HincII-BamHI 3' rat IL-5 
fragment, and the Bglll-digested and dephosphorylated pXTl 
were ligated, resulting in pXTRILS. The cell line psi 2 27 was 
transfected with pXTRILS using a mammalian transfection 
kit (Stratagene, La Jolla, CA) and selected in medium 
containing 1 mg/ml of G418, and clones were established 
producing recombinant, helper-free viruses. The virus titers 
were measured by determination of the number of G418 (1 
mg/ml) resistant colonies (CFU/ml) after infection of cul- 
tures of 1 x 10 5 NIH 3T3 cells with serial dilutions of 
supernatants derived from 1 x 10 6 virus producing cells per 
milliliter cultured for 24 h. 

Cell Lines 

Psi 2 27 and NIH 3T3 cells were maintained in Dulbecco's 
Modified Eagles 1 Medium (DMEM) supplemented with 10% 
fetal calf serum (FCS) and 2 mM glutamine. Cell lines B13 22 



• 




Rat interleukin-5 gene / 79 



and T88M 23 were cultured in RPMI-1640 (Seromed) supple- 
mented with 10% FCS, 50 u,M 2-mercaptoethanol, and 20 
U/ml of IL-5. Both cell lines seem to represent immature B 
cells, and their growth is dependent on exogenous IL-5. 
CT.4S is an IL-4-dependently growing derivative of CTLL 28 
and has been cultured as described. 29 

Retroviral Infection 

Cells were cultured for 12 to 16 h in the presence of 
culture supernatants derived from psi 2 -XTRIL5 cells supple- 
mented with 4 u,g/ml polybrene and 20 U/ml of IL-5. 
Afterwards, the culture supernatant was substituted by fresh 
medium containing 1 mg/ml G418 to select for infected cells. 

Southern Blot Analysis 

High molecular weight liver DNA of a BALB/c, C3H, 
DBA/2, MOLF/EI (inbred Mus musculus molossinus), 
CAST/EI (Mus musculus castaneus), M. musculus musculus, 
M. spretus, M. caroli, M. cookii, M. pyromys saxicola, M. 
coelomys pahari mice, and Wistar and Lewis rats and DNA of 
cell lines, as indicated in the text, were extracted as de- 
scribed. 2630 Southern blot analysis was performed as previ- 
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Figure 8. Analysis of sizes of IL-5 intron 1 regions from various 
species. 

(Top) Analysis of the size of IL-5 intron 1 region of indicated animals 
by PCR using primers specific for exon 1 and exon 2. The tentative 
times of radiation are indicated. (Bottom) The same agarose gel was 
subjected to Southern blot analysis using a rat IL-5 intron 1-specific 
probe. The exposure time was 30 min. 
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Figure 9. Comparison of human, rat, and mouse IL-5 amino acid 
sequences. 

Identical residues are indicated by dashes and have been boxed. The 
exon regions are indicated above the sequences. 



ously described. 7 Either the nick-translated 630-bp BamHI- 
AccI or the 450-bp SacI-AccI fragment of plasmid pSP6K- 
mTRF23 3 was used as probe. 

Polymerase Chain Reaction 

The polymerase chain reaction was performed in a 
100-u,l volume containing 500 ng of genomic DNA, 0.1 u,M of 
each primer, 50 mM KC1, 10 mM Tris-HCI (pH 8.4 at RT), 
1.5 mM MgCl 2 , 100 p.g/m! gelatin, and 200 u,M of each 
deoxynucleotide triphosphate (dATP, dCTP, dGTP, dTTP). 
The template DNA was denatured for 5 min at 94°C, primer 
annealing was allowed to occur at 55°C for 1 min, 2 U of Taq 
polymerase (Amersham) were added, and the first primer 
extension proceeded at 72°C for VA min. Then, amplification 
occurred during 25 cycles of denaturing at 94°C (30 sec), 
annealing at 55°C (60 sec), and elongation at 72°C (90 sec). 
An aliquot of the amplified fragments was size-separated by 
agarose gel electrophoresis and specificity was controlled by 
Southern blot analysis using the IL-5 probe (see above). The 
following two pairs of primers have been used: (1) 5'- 
GG AATTCAACAGCGTGCCGCA-3 ' and 5'-GGAATTC- 
CCCTGGCGG ACGGG-3 ' and (2) 5 ' -GCG AATTCG AA- 
CACGCAGATGCAG-3' and 5 '-CGGATCCCCT- 
GGCGGACGGGAAGT-3'. Each set contains a primer 
which is specific for a region 5' of the Bglll cloning site in the 
TK promoter. 31 The corresponding second primer is specific 
for Moloney murine leukemia virus (MoMuLV) sequences 
located 3' of the Bglll site in pXTl. In addition, the primers 
contain recognition sequences for EcoRI or BamHI. An 
aliquot of the amplified fragment was digested with EcoRI 
(first primer pairs) or EcoRI/BamHI (second primer pairs), 
cloned into M13 vectors, and subjected to DNA sequence 
analysis. For the amplification of the first intron of murine 
IL-5 genes the following primer pair was used: 5'-ACTCT- 
C AGCTGTGTCTG G GCC A-3 ' and 5'-GTAGGGACAG- 
GAAGCCTCATCGT-3'. The polymerase chain reaction 
mix additionally contained 1 U of "perfect match" (Strata- 
gene). The annealing temperature was 60°C. 
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DNA Sequence Analysis 

Nucleotide sequence analysis was performed by the 
dideoxy chain termination method. 31 For the processing of 
DNA sequence data the DNASIS program (Pharmacia, 
Uppsala, Sweden) was used. 

Assay for IL-5 Activity 

The biological activity of IL-5 was measured by [ 3 H]- 
thymidine incorporation of T88M cells. Cells (5 x 10 3 cells/ 
0.2 ml) cultured in microtiter plates were incubated for 24 h 
with serial dilutions of IL-5-containing supernatants. Cells 
were pulse-labeled with 0.5 u,Ci/wdl [ 3 H]-thymidine during 
another 12-h culture period, and DNA synthesis was deter- 
mined by [ 3 H]-thymidine incorporation. One unit of IL-5 
activity was denned as the amount of IL-5 required for 
half-maximal proliferation. In some experiments, cell prolif- 
eration was determined in the presence of the rat anti-mouse- 
IL5-receptor mAb R52.120 (IgGl/K) 22 or the rat anti-mouse - 
IL4mAb 11B11 (IgGl/K). 32 
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Rat peritoneal exudate cells produce two interleukin 
6 (IL6) messenger RNA species, a major 1200 nucleo- 
tide and a 5 -fold less abundant, 2 400 -nucleotide spe- 
cies. A cDNA clone representing the major species was 
isolated, and sequenced. The 1055-base pair insert 
covered the 3 '-non translated region, the 211 triplet 
coding region and most of the 5'-nontranslated region. 
The derived rat IL6 amino acid sequence was 93 and 
58% identical, ^respectively, with mature murine and 
human IL6. Rat EL6 lacks iV-glycosylation sites but 
contains a fifth cysteinyl residue in addition to the 4 
residues shared in conserved positions with murine and 
human IL6. Stable murine L cell and human HeLa- 
derived cell lines were established by cotransfection 
with rat IL6 cDNA and a selectable neomycin resist- 
ance marker. These lines secrete 9- fold increased 
amounts of functional IL6 over their respective paren- 
tal cells. A stable rat macrophage- derived cell line, 
RM-SV1, was established by transformation with sim- 
ian virus 40. IL6 and 111 mRNA levels are inducible 
20- and 3. 5- fold, respectively, in this line by treatment 
with lipopoly saccharide with kinetics characteristic of 
macrophages. 

A set of three overlapping genomic DNA clones was 
isolated and a 10-kilobase DNA segment was se- 
quenced containing the rat IL.6 gene plus 2.9 kilobases 
of 5 '-flanking and 1.3 kilobases of 3 '-flanking se- 
quences. The two transcription start sites used in RM- 
SV1 cells were mapped within 5 base pairs of each 
other. The exon/intron boundaries are conserved with 
the murine and human IL6 genes. The two IL.6 mRNA 
species are generated by alternative polyadenylation 
at sites separated by a distance of 1.2 kilobases. The 
intervening region contains a repetitive element 72— 
80% identical with the rat and murine consensus LI 
family sequences. 



* This work was supported in part by Grants AI23351 and AI22166 
from the National Institute of Allergy and Infectious Diseases (to G. 
H. F.) and Grant AI27493 (to W. N.). This is publication number 
5764- 1 MM from the Department of Immunology, Research Institute 
of Scripps Clinic. The costs of publication of this article were defrayed 
in part by the payment of page charges. This article must therefore 
be hereby marked "advertisement" in accordance with 18 U.S.C. 
Section 1734 solely to indicate this fact. 

The nucleotide sequence(s) reported in this paper has been submitted 
to the GenBank™/EMBL Data Bank with accession numberis) 
Af 26744 and M26745. 

§ To whom correspondence should be addressed: Dept. of Immu- 
nology, IMM14, Research Institute of Scripps Clinic, 10666 N. Torrey 
Pines Rd., La Jolla, CA 92037. 



Interleukin 6 (IL6) 1 is a multifunctional cytokine with both 
differentiation and growth-promoting effects for a variety of 
target cell types. It is a B-cell stimulatory factor inducing 
terminal differentiation and high level antibody production 
in B lymphocytes (IL6/BSF-2, 1, 2), and it acts as a hybrid- 
oma/plasmacytoma growth factor (3, 4). It promotes growth 
of B- myeloma cells as well as T- lymphocyte growth and 
differentiation (1, 5, 6), but it inhibits the growth of primary 
rat hepatocytes in culture (7). The hormone stimulates the 
growth of hemopoietic stem cells, and in synergism with 
interleukin 3 (IL3) it supports the proliferation of cultured 
multipotential hemopoietic progenitor cells (8, 9). IL6 is a 
mediator of inflammation and a major alarm hormone signal- 
ing tissue damage and infection to the body's host defense 
system and in particular to the liver. Its ability to induce the 
synthesis of acute phase plasma proteins in hepatocytes is 
referred to as its hepatocyte stimulatory function (IL6/HSF, 
10; reviewed in 1, 11). This cytokine also causes fever in 
rabbits and humans and is therefore suspected to have a direct 
effect on cells of the nervous system (12). 

IL6 is produced by many different cell types including 
fibroblasts, macrophages/monocytes, T and B lymphocytes, 
endothelial cells, synovial cells, keratinocytes, and by a num- 
ber of tumor-derived cell lines (1, 11, 13-16) including several 
rat and human hepatoma cell lines (17, 18). Although the IL6 
gene is expressed at low levels in a large number of cell types 
(19), the cell types listed here and various tumor cells are the 
most abundant producers of IL6 in mammals. 

Expression of the IL6 gene can be induced by a number of 
different signals, including cytokines (interleukin 1, ILl; tu- 
mor necrosis factor a, platelet-derived growth factor, inter- 
feron f$ u and epidermal growth factor), serum, double- 
stranded polyribonucleotides, cycloheximide, phorbolesters, 
agents that increase intracellular cyclic AMP, viral infections, 
and bacterial endotoxins (1, 11, 20-26). Induction of the IL6 
gene is a very early event in the inflammatory response that 
leads to important secondary effects on the homeostatic de- 
fense system (52). In addition, it has been hypothesized that 
activation of the IL6 gene may not only be a secondary 
consequence of oncogenic growth transformation of cells but 
may itself be part of the initial events leading to oncogenic 
transformation (1, 11, 17). Therefore, it is important to un- 
derstand the molecular mechanisms causing the activation of 

1 The abbreviations used are: ILl, IL6, interleukin 1, interleukin 
6; HSF, hepatocyte stimulating factor; BSF-2; B-cell stimulatory 
factor (According to a nomenclature convention reached at a meeting 
on IL6 of the New York Academy of Sciences in December 1988, IL6/ 
HSF refers to the HSF function of IL6, IL6/BSF-2 to the BSF-2 
function, IL6/HGF to the HGF function, etc.); LPS, lipopolysaccha- 
ride; SDS, sodium dodecyl sulfate; PIPES, piperazine-/v* I 7V'-bis(2* 
ethanesulfonic acid); PECs, peritoneal exudate cells; SSC, standard 
sodium citrate; bp, base pairs; kb, kilobases. 
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this gene. As a first step in this direction, we have isolated 
and sequenced rat IL6 cDNA and genomic DNA clones. 

Here we report the gene and cDNA sequences and the 
cDNA-derived amino acid sequence of rat IL6 and compare 
it with its murine and human counterparts. We describe the 
establishment of a rat macrophage -derived cell line that was 
used as a convenient source of raRNA and as a tool for studies 
of the expression of this gene in macrophage-like cells. We 
also report the establishment of murine L cell and human 
HeLa-derived cell lines that express and secrete functionally 
active recombinant rat IL6. The exon/intron block structure 
of the gene has been determined, and we report the presence 
of two IL6 mRNA species derived from the single copy rat 
IL6 gene by alternative polyadenylation. 

EXPERIMENTAL PROCEDURES 

Animals and Materials — Male Fisher 344 rats (Simonsen Labora- 
tories, Gilroy, CA) were used for the preparation of primary peritoneal 
exudate cell cultures and RNA preparation. Laboratory chemicals 
and standard enzymes for molecular biology were from J. T. Baker 
Chemical Co., Mallinckrodt, Fisher, Sigma, New England Biolabs, 
Bethesda Research Laboratories, Stratagene, United States Biochem- 
icals, and Boehringer Mannheim as described (27). Fetal calf serum 
and tissue culture supplies were from GIBCO and Flow Laboratories, 
and radionucleotides from Amersham Corp. and ICN. Lipopolysac- 
charide was from Sigma. 

Peritoneal Exudate Celt Cultures and RNA Preparations — After 
euthanasia with C0 2 gas, the peritoneal lavage was performed by two 
consecutive intraperitoneal injections of 35 ml of serum free, ice-cold 
Dulbecco's modified minimal essential medium (Scripps Media Pro- 
duction) into rats. The lavage fluid was collected, and the peritoneal 
cells were recovered by low speed centrifugation. The cells were 
resuspended in serum- free medium, washed by another cycle of cen- 
trifugation and resuspension, then plated in tissue culture Petri dishes 
and allowed to attach for 4 h at 37 °C. Nonadherent cells were 
removed by two cycles of washing with phosphate -buffered saline, 
and the cells were supplied with Dulbecco's modified minimal essen- 
tial medium containing 10% fetal bovine serum and kept at 37 °C for 
20 h. Supernatants were then collected and assayed for IL6/HSF 
activity, and RNA was prepared by lysis of the cells in guanidinium 
thiocyanate and high speed sedimentation of the RNA through a 
cesium chloride cushion (28). Northern blot analysis was performed 
in agarose formaldehyde gels as previously described (29, 30). North- 
ern blots were hybridized in a solution containing 50% formamide, 
20 mM PIPES, pH 6.5, 0.8 M sodium chloride, 1 mM EDTA, 300 /ig/ 
ml of denatured salmon sperm DNA, and 0.5% sodium dodecyl sulfate 
(SDS) for 24 h at 42 *C. They were washed two times for 30 min in 
500 ml of wash solution at 42 *C. The first wash solution was 0.1% 
SDS containing 2 X standard sodium citrate (SSC), the second wash 
solution contained 0.1% SDS in 1 x SSC. SSC is 0.15 M sodium 
chloride, containing 0.015 m sodium citrate. 

Preparation and Screening of cDNA Libraries — The first cDNA 
library was produced using total RNA prepared as described above 
from adherent primary rat peritoneal exudate cells in the vector 
pGEXII, a commercial derivative (Genecell, Inc., San Diego) of the 
Okayama and Berg vector pcDV 1(31). Approximately 60,000 colonies 
of a cDNA library consisting of 40,000 independent clones were 
hybridized with a murine IL6 cDNA probe (32). The second cDNA 
library was produced from polyadenylated mRNA prepared from the 
rat macrophage cell line RM-SVl (see "Results ") in the commercial 
vector AZAPII (Stratagene, Inc., San Diego). The second library was 
screened with the rat IL6 cDNA clone pIL6C.9l isolated from the 
first library and with a 1.22-kb Sacl/BamHl genomic DNA fragment 
from the 3' end of the gene (see "Results"). 

DNA Sequencing and Computer-assisted Data Analysis — Random 
subfragments of cDNA and genomic DNA clones were generated by 
sonication and were subcloned into M13 phage vectors. DNA sequenc- 
ing with the dideoxynucleotide technique and sequenase enzyme 
(USB) followed standard procedures (34). Sequencing data were 
collected and analyzed using Staden's database system (35) and the 
University of Wisconsin Genetics Computer Group programs (36) on 
a Digital VAX 11/750 computer. The genomic DNA sequence of 
10,067 nucleotides was derived from a total database of 63,252 nucle- 
otides. The average nucleotide of the sequence was covered by 6.3 
-independent-sequence-gel-readingS7-and-100%-of-the sequence was — 
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covered on both strands.. The sequence of the cDNA clone represent- 
ing the major I.2-kb mRNA species with an insert of 1,055 base pairs 
(bp) was derived from a total database of 2,514 characters. Only 85% 
were covered on both strands, but the sequence was matched com- 
pletely by the previously established genomic DNA sequence. The 
cDNA representing the 2.4 -kb mRNA species had an insert length of 
1,909 nucleotides and was covered by a total database of 6,343 
nucleotides. Its sequence was matched completely by the genomic 
DNA sequence and by the sequence of the cDNA clone representing 
the 1.2-kb mRNA species. For some cDNA clones the terminal 
sequences of the inserts were determined using the double-stranded 
plasmid DNA sequencing method (37). 

Biological Assay for ILS/HSF Function — A novel indicator cell line 
for IL6/HSF function and a novel assay were established (17). 2 The 
indicator line FaO/HB3 is a derivative of the well -characterized rat 
hepatoma cell line FaO, which was previously shown to respond to 
treatment with IL6/HSF by increased transcription of the a 2 -mac- 
roglobulin gene and other characteristic acute phase genes (17, 38). 
The FaO/HB3 line was established by cotransfection of FaO cells 
with a plasmid carrying the 5' -flanking control region of the rat a 2 - 
macroglobulin gene (17, 39, 40) directing the expression of the firefly 
luciferase gene (41) and a neomycin (G418) resistance marker. FaO/ 
HB3 cells respond to treatment with IL6/HSF by production of 
luciferase activity. For the assay, FaO/HB3 cells were grown to 
confluence, then split 1:1 and allowed to grow overnight (12 h) in 
presence of 10~ 7 M dexamethasone. The cells were then treated for 
4-5 h with various test supernatants of cultured cells or with known 
concentrations of purified recombinant human IL6 produced in Esch- 
erichia coli. This material, with a specific activity of 5 X 10 6 IL6/ 
BSF-2 units/mg, was obtained from Dr. Asagoe and colleagues, Tosoh 
Inc., Japan and Drs. Hirano and Kishimoto, Osaka, Japan. FaO/HB3 
cell extracts were prepared according to a standard procedure 3 and 
used for luciferase assays in a luminometer (Monolight 2001; Analyt- 
ical Bioluminescence, Inc., San Diego, CA) following the instructions 
of the supplier (41). The protein concentrations in the extracts were 
determined with a color assay (Bio-Rad), and the luciferase activities 
were normalized to constant amounts of extracts, usually to 200 fig 
of protein (see "Results," Table I). The luciferase activities were then 
converted to IL6/BSF-2 units using a calibration with known 
amounts of recombinant human IL6. 

Screening Genomic Libraries and Characterization of Genomic DNA 
Clones — A rat genomic DNA library, constructed with EcoKl partial 
restriction fragments in the phage vector A charon 4A (42) was 
screened with the rat IL6 cDNA clone pRIL6C.9l. Screening and 
characterization of the isolated clones by restriction enzyme cleavage 
mapping were performed as described (39). 

RNase Protection Assays and RNase H Experiments — To map the 
5' ends of IL6 mRNA by RNase protection, a 1.205-kb Hindlll/ 
BstXl genomic DNA fragment containing exon 1 (see "Results," Fig. 
9) was blunt-ended with T4 DNA polymerase and subcloned into the 
Smal site in the polylinker of the Bluescript® KS vector (Stratagene). 
This construct linearized with Bom HI was used to generate an 
internally radiolabeled RNA copy of genomic DNA sequence, com- 
plementary to IL6 mRNA. The construct was copied with T7 polym- 
erase following a protocol provided by the supplier (Stratagene). Two 
fig of plasmid DNA were suspended in a 50-//1 reaction volume, 
containing 40 mM Tris-HCl, pH 8, 8 mM magnesium chloride, 2 mM 
spermidine, 50 mM sodium chloride, 0.4 mM each of ribo-ATP, ribo- 
CTP, ribo-GTP and 40 n"M of UTP, 150 »Ci of [ M P]UTP (400-800 
Ci/mM, Amersham Corp.), 30 mM dithiothreitol, 50 units of RNasein 
(Promega) and 10 units of T7 RNA polymerase (Stratagene). The 
copying reaction was carried out for 45 min at 37 °C. The reaction 
mixture was then treated with 20 units of RNase-free DNase I 
(Stratagene) for 15 min at 37 *C. The reaction product was freed of 
unincorporated nucleotides by two cycles of addition of an equal 
volume of 4 M ammonium acetate and precipitation with 2 volumes 
of ethanol (27). The final T7 polymerase product was dried and 
redissolved in 50 m1 of H a O. For the annealing reaction, 12 fig of total 
RNA (extracted from the LPS-stimulated macrophage-like cell line 
RM-SVl, Results) were combined with 1 ^1 of the T7 polymerase 
product and the RNA was denatured in a 30-^1 volume containing 
80% formamide for 10 min at 75 *C. Then 3 ^1 of 10 x PIPES buffer 
(100 mM PIPES, pH 6.4, 4 m sodium chloride, 10 mM EDTA) were 
added and annealing was performed for 16 h at 45 *C. The reaction 
mixture was chilled on ice, and 300 nl of RNase buffer were added, 

2 M. Hattori and G. H. Fey, unpublished results. 

- 3 -M-Hattori;-unpubHshed-results: — 
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containing 10 mM Tris-HCl, pH 7.5, 5 mM EDTA, 200 mM sodium 
chloride, and 100 mM lithium chloride. RNase A (Boehringer. Pan- 
creatic RNase) was added to a final concentration of 100 pg/ml, and 
1000 units of RNase Tl (Bethesda Research Laboratories) were 
added. Incubation was allowed to proceed for 30 min at 37 "C. SDS 
was added to a final concentration of 0.3%, and 10 /d of a proteinase 
K (Boehringer Mannheim) solution containing 10 mg/ml were added. 
The reaction mixture was incubated for 25 min at 37 *C, and finally 
the RNase-resistant RNA products were extracted once wkh phenol, 
precipitated and washed with ethanol, and lyophilized to dryness. 
They were resuspended in 6 *d of formamide sample buffer (95% 
formamide, containing 20 mM EDTA, 0.05% bromphenol blue 
(Sigma), 0.05% Xylene Cyanole FF (Eastman Kodak)), and analyzed 
by electrophoresis in a 6% polyacrylamide urea sequencing gel (27, 
34) and autoradiography. RNase H experiments were performed as 
described (29, 30, 39). 

Southern Blot Analysis of Genomic DNA — The analysis was per- 
formed as described (39). 

RESULTS 

Rat Peritoneal Exudate Cells Produce Two Discrete Classes 
ofIL6 mRNA — Northern blot analysis of RNA extracted from 
primary cultures of adherent rat peritoneal exudate cells 
(PECs) with a cloned murine IL6 cD NA probe (32) revealed 
the following two size classes of IL6 mRNA: a "major" class 
of 1.2 to 1.3 kb and a "minor" class of 2.4 kb (Fig. 1). Judged 
by quantitative densitometry of the autoradiography, the mi- 
nor class was approximately five times less abundant than 
the major class. 

Isolation and Sequence Analysis of Rat IL6 cDNA Clones — 
Total RNA was extracted from primary cultures of adherent 
rat PECs, stimulated by treatment in culture with LPS (see 
Experimental Procedures"). A cDNA library was constructed 
in the vector pGEXII (31) and screened with a cloned murine 
IL6 cDNA probe (32). Fourteen candidate cDNA clones were 
isolated from approximately 60,000 colonies plated and ana- 
lyzed by restriction fragment length analysis (agarose gel 
electrophoresis). Further work was performed with clone 
pRIL6C.91, which carried the longest cDNA insert. The insert 
of this clone (1,055 bp, Fig. 2), covers most of the 5' -nontrans- 
lated region, the protein coding region, and all of the 3'- 
nontranslated region. The coding sequence specifies an open 
reading frame of 211 triplets (Fig. 2). The attachment site for 




Fu;. 1. Primary cultures of adherent rat PECs produce two 
IL.6 mRNA species of distinct size. Primary cultures of adherent 
rat PECs were produced and RNA was extracted as described under 
"Experimental Procedures." Twenty fig of total RNA were elect ro- 
phoretically separated in an agarose formaldehyde gel and transferred 
to nylon membranes. The blot was hybridized with a cloned murine 
1 LG cD NA prob e (32). Size evaluation was made by comparison with 
The mobilities orRNA^i^lriarkers (not shown). 
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FiG. 2. cDNA sequence and derived amino acid sequence of 
rat IL6. The cDNA sequence and the derived amino acid sequence 
of the insert of clone pRIL6C.91 is shown. The position of the 
polyadenylation site was deduced by comparison with the genomic 
DNA sequence. The polyadenylation signal AATGAA (see "Discus- 
sion*') is underlined, the asterisk locates the translational stop codon 
{TAG). The block structure shown below represents the IL6 cDNA. 
A single large open reading frame (ATG-TAG) encodes a precursor 
protein with 21 1 amino acids. The black box corresponds to the mature 
protein with 187 amino acids, the dotted box to the signal peptide 
with 24 amino acids, and the white boxes indicate the 5'* and 3'- 
nontranslated regions, respectively. The numbers correspond to the 
nucleotide sequence. 

the poly{A) tail was deduced by comparison of cDNA and 
genomic DNA sequences (see below). From the first cDNA 
library, no clones representing the 2.4-kb mRNA species were 
obtained, but subsequent screening of a second library led to 
the isolation and sequence analysis of such clones (see below). 

Properties of the Mature Rat IL6 Protein, Deduced from 
cDNA Sequence — The cDNA-de rived amino acid sequence of 
mature rat I.L6 is compared in Fig. 3 with the murine and 
human IL6 protein sequences. The rat and human cDNAs 
are 68% identical, and the corresponding rat and murine 
cDNA sequences are 92% identical. The rat and human 
protein sequences are 58% identical and the rat and murine 
protein sequences 93%. Murine, rat, and human IL6 contain 
4 cysteinyl residues in conserved positions, and rat IL6 has 
an additional cysteine at position 88 (Fig. 3). The rat JL6 
sequence carries no obvious TV-glycosylation sites. The assign- 
ment of the NH L > terminus of mature rat IL6 is based on a 
comparison with the known NH 2 termini of murine ILG. 

Development of Stable Mammalian Cell Lines Expressing 
7?aP/ra=T6~Hemonstrate thiTthe cDN"A~cIone pRI C6C79l~ 
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Fig. 3. Comparison of human, 
rat, and murine mature IL6 amino 
acid sequences. The human and mu- 
rine sequences are as published (2. 32, 
33). The NH>j termini of murine and 
human IL6 have been determined by 
sequence analysis of the purified mature 
proteins (2, 43, 44). The NHa terminus 
of rat \1S is deduced by sequence com- 
parison with murine IL6. Identical resi- 
dues are boxed. Gaps (dashes) were in- 
troduced as suggested by the computer 
program BESTFIT of the University of 
Wisconsin package (36) to produce op- 
timal alignments. 
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FiG. 4, Induction of IL1 and IL6 mRNA levels in the cell 
line RM-SV1 by treatment with LPS. RM-SVl cells were grown 
in the presence of 0.o% fetal bovine serum. They were incubated for 
various time intervals with 1 Mg/ml of LPS from E. coli. Total RNA 
was then extracted and aliquots of 10 pg of RNA were deposited on 
nylon membranes and covalently fixed by UV irradiation. Duplicate 
filter sets were separately hybridized with a nick -translated rat IL6 
cDNA probe (pRIL6C.91) and a murine ILl cDNA probe (51). Au- 
to radiographs were evaluated by automated quantitative scanning 
densitometry. Open circles, ILl mRNA; dosed circles, IL6 mRNA. 

codes for biologically active IL6, it was used for DNA trans- 
fection into mammalian cells. The transfected cells secreted 
significantly more biologically active IL6 than the untrans- 
fect.ed parental cells. To this effect, the cDNA insert of clone 
pRI.L6C.91 was subcloned into the expression vector pcDL- 
SR296 designed for high level expression in mammalian cells 
(45). DNA of this subclone, called pR!L6C92, was transfected 
.into murine L cells and human HeLa cells. Cotransfections 
were performed with a neomycin (0418) resistance marker, 
and G418-resistant colonies were selected. Stable cell lines 
were established by reiterated rounds of single cell subcloning, 
and the lines MLTK-RIL6.92 and HeLa-RIL6.92 contain 
stably associated rat 1L6 cDNA sequences as judged by their 
ability to produce constant levels of IL6 activity (see below) 
lor over 20 passages. 

For comparison, an immortalized rat PEC-deriyed cell line, 
was also established. To this effect, primary cultures of ad- 
herein rat PECs were infected with the DNA tumor virus 



Table J 

Accumulation of biologically active IL6/HSF in various 
culture super natanls 

IL6/HSF activity was assayed with the FaO/HB3 cell line using 
the luciferase assay (Experimental Procedures). 





Luciferase 
activi ty" 


IL6 
activity* 


RM-SVl 


2,882 


110 


RM-SVl + 10 Mg/mi LPS' . 


1 8,964 


873 


MLTK rf 


326 


11 


■ MLTK-RIL6.92 


2,1.14 


95 


HeLa*' 


487 


19 


HeLa-RlL6.92 


3,781 


175 


rhu IL6' 


2 T 317 


100 


rhu IL6 


7,000 


300 


rhu 1L6 


15,092 


750 



° Light counts integrated over a 30- s interval in a standard assay 
(see "Experimental Procedures") normalized for 200 jig of protein 
extract of FaO/HB3 cells. Basal level ao-macroglobulin promoter 
activity obtained by culturing FaO/HB3 cells with media alone (696 
counts) was subtracted from all activity values in this column. 

6 JL6/BSF-2 units/ml, as determined from calibration with recom- 
binant human JL6. 

' Treatment with LPS for 20 h. All values in this table are activities 
accumulated in a 20 -h period. 

rf Control cell lines transfected with neomycin resistant vector only. 

' Recombinant human IL6. 

SV40 at a multiplicity of approximately 0.1 plaque forming 
units/cell. Foci of growing cells developed 2 weeks after infec- 
tion and well- isolated foci were expanded into cell lines by 
multiple rounds of subcloning. The cell line RM-SVl, estab- 
lished in this manner, displays macrophage-like cellular mor- 
phology and other traits characteristic of highly differentiated 
macrophages. In particular, the mRNAs coding for ILl and 
IL6 can be induced by stimulation of this line with LPS {Fig. 
4). 1L6 mRNA levels were increased 20- fold with a maximum 
at 4 h after stimulation, and ILl mRNA levels were increased 
3.5- fold with a maximum at 5 h. 

The cDNA Clone pRJLSC.92 Codes for Biologically Active 
I L6/HSF— Culture supernatants of the cell lines MLTK- 
RIL6.92 and HeLA-RIL6.92 as well as of LPS-st:imulated 
RM-SVl cells and corresponding controls were assayed for 
the presence of biologically active 1L6 by using the indicator 
cell line FaO/HB3. This line is a sensitive tool to monitor the 
HSF. function of IL6. It converts an IL6/HSF input signal 
rapidl y into lu cif era s e activi ty, w h i c h can b e measured wit h 
high precision (see "Experimenial Procedures"). Using this 
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1 AATTCTTCAA ATGGGAAACC TGTCAAGATA CAGAGAAAAG AATGTATAAT GGTAGATAGA GrTTAGAGTT TAGAGTTTAG AGTTACTGTG CTAATATTTT 
102 GGTTTAAGAA GAGTGTTTTT CTCCCATAAA CAAATGCCAA TCAAAGGCAG TGGTGCTAAT AAAGATCCAT GCTGGTGACA TACATACAAA GGAACAAGAC 
201 AGAATCACCC TGCCTTCCAA ACCTTGAATA CTACAAACTT AAGCTCTCCG GTTGTCAAGG CCATTTATGC TTCAGAATTA TGCTCAGAAA CTTGAAGGAT 
301 GATGATATGT ATAGACTGTT AATTCTCCTA TAAAAAAAAC TTAGCAGAAT ACTTTGGGAA AACTCTAGGA GTAAAGCACA GCTGGCTAAT TAGTGAGTAA 
401 GGGACCTAGT TTGAGTATGG CTTTGTCTGC GTGTGTGAAT GTGCGTCACG TATATTCAGT GCCAACTGAG GCTATAAGAG GGCATCAGAT ACCCAGGAAC 
501 AAGAGTTACA GATGATTTAG AACCACGATC TAGAAACTGA ACCCACATCC TCTTGAAAGA GCAATAGATG ACCTTAACGT CCAAGTCATC TGTTCAGCTC 
601 CTCATAATGA ATTTTCAATA GCACCACCAA TCTCATTTCT TTAGGGATAA AATGGTTCAG ACACTAACTT CATCCTTGGT ACAGTAAGCT GGTTGTTCTC 
701 CATAGTAACT TCACAAGCAA GGTCCCAGGT GAACTGATAG TTATGCCAGA AAAGCTCCCT GAGGGAGAAG TTCCAGCTCT TAAAAATAGT AGAGTTGGTG 
801 CTTACTCTTT CTTAGTTATC TTTACGGCAG CTCTGTATGA TGACCTAGTA TTATTTTCCT GTTAAAGAGT AAATAAACAA GAACACAGAT CTTGGGCTAA 
901 TTTAGCCACT AATAAACAGC TAGCAAATGG TGGCATTGGA CTTCTAACCC AAATTATTAC TCAAAAGGCT ATCGTCTCCA CATGTACCCT GCCCCCACCC 
1001 CCCACCTCCA TGAAAAGCGA TATAGACATC ATCCAGACCA TTAACATACA GTGTGTATCT CTATGTATAA ACATGCATTT AGAGAGATAT ATGTGTGAAA 
1101 TGTATATGTA TAAATATATG AGCTGAGAAA ACCTCTTCCA GATGAACCTG ACAAGAGTTC CACCGAAACT A CTGG GGTTT AGGCACCACT TGTAGGACCA 

12 01 TTACTTTCCT CTGTTGTCTT GTTTGAACAT TTTATCATGG ATGCATGCCT CCTACTTAAA GACCATTTTT TTAAAAATTG AAAACAGAAA TTTCAATGTT 

13 01 GTAGTTAGTC TAACAGTCAA GCTTATTGAG TCATTTAAAA ATTGTTAAGT GGAGAAAAGA AACTCTATAC CTTAGAGACC TGAGACAACA CATGGCAGGG 
1401 AGTAGGGGAA GTAGAGACAT ACTGAGTTCC AGGCTATTTC AGCTGCCATT TTAAGGTCCA AGGGCACTAA AAGTAAAACC ATCCTGCTTT CTCTCCCCCA 
1S01 TTTC CATTTT CAACTACAAT CCCCTAGAAC CTATGTGAGG ACTAGTGCCT CACACAG TGA AAGAATGGGA AACCTCACTT CTTCAATAGG CAGTTTCTCC 
1601 TTTATGTACA CAAACAAGAG TCCACCCATA TTTTAACTGT CCCACCAACA GTCATTGCTT TCAGAGTCTT AAGGCAATCT GCAAAGTGAA GAAACTGATT 
1701 TCGATGCTAA GACTCTGATT TG TAAATAG T TTTGGC TTTA TATACAGAGC CTACTTTCAG CCTGTAATCA TTTTGAGTGC TATCTAGGTA TCTGGAGACA 
1801 GGTGGACAGA AAACCAGGGA CTAATCTGTA AAATAAATTG AACAAAGGGA AGAAGTCTGT TTAAGCTTGA CCCACCCTAG AAG TCTCGGA CATTAGAAGG 
1901 CTCATTCTGA CTCAGACATG CCACCTTTAG ATTCAAATCC TGTCACCCAG TTAAAGGGAG CTTCAAACAT CTAGCTACGA TACAACGAGG TCCTTCTTTG 
2001 CTGTAAACCC TGAATAAACG TAAGGACTGC AACAAACCTT CAAGTCTCCT TGCATGACCT CGAAATGTTT TCGGCATGTT CTGGCAGCAC TGGGATCAGC 
2101 ACTAACAAAT AAGGCAACTC TATTACAGAA ACTAAAAGTC TTACTAGTAA GATGGCCAAG AGACCACTAG GGAGAAATCA GAAAGTGGAC TTGGACTTGA 
2201 GAGGAAACCA GGAGTGCTTG ATGACAGAAG ATATTTCTCT TCTTCACTCC ATAGTAGCCC ACCTCGCAAC TCCTCGAAAC AACTGCACAA AATTTGGAGG 
2 301 TGAACAAACC ATTAGAAGCA GCTGGTCCTG ATAACATGCA GGAAGAACAA GCAATATGCA GCAGGCATCA CTACTGTCTG TGTGACTCCA GGTCAGGTGC 
2 401 TGGGCGTGGA CTGGGAGTGG GAGTGTGTGT CTTTCTATGA TCTGAAGAAA CTTAGGTCAG AACATCTTTA GATTCTCACA GACATGAAAA AATTAGAAGA 
2501 GTAAATCCTG CCCTCTTATT AATTCATGAG TGTTTGTGTG TGCGCACATG TGTTTAAATA ACATCAGCTT TAGCTTCACT TTCTCCTTAT AAAACATTGT 
2601 GCATTTCAGT TTTTCCCCCT ATCAAGTGCT CAAGTGCTGA GTCACTTTTA AAGAAAGAAA AAGAG TGAT C AGGCTTC TTA AGGATAGCCT CAAGGATGAC 
27 01 TTAAACACAC TTTCCCCCTC CTAGCTGTGA TTCTTTGGAT GCTAAATGAC GTCACATTGT GCAATCTTAA TAAGGTTTCC AATCAGCCCC ACCCACTCTG 

TATA 

2 801 GCCCCACCCC CACCCTCCAA CAAAGATTTT TATCAAATGT GGCATTTTCC CATGAGTCTC AAAAGTAGAG AGTCGACTCC CAA TAAATAT GAGACTCGGG 

m M 

2901 ATCTCTGTAG CTCATTCTGT CTCGAGCCCA CCACGAACGA AAGTCAACTC CATCTCCCCT TCAGGAACAG CTATGAAGTT TCTCTCCGCA AGTAAGTGAA tX 1 



3001 GGCAGTTTCT CGCCCTCTGG CGGAGCTATT GAGACTGTGA GAGGAGTGTG ACGCAGGGAG CCAGCCAAGT GGGTTGGCTA GCAGCCAGCA GCCAGCATCA 
3101 GGCGCCCAGC TGCGCTATTT TCTCATTTGC TTCTTTCCCT TTCCTCCACG CAGGAGACTT CCAGCCAGTT GCCTTCTTGG GACTGATGTT GTTGACAGCC Ex2 



3201 


ACTGCCTTCC 


CTACTTCACA 


AGTCCGGAGA 


GGAGACTTCA 


CAGAGGATAC 


CACCCACAAC 


AGACCAGTAT 


ATA C CACTTC 


A CAAG TCGGA 


GGCTTAATTA 


3301 


CATATGTTCT 


CAGGGAGATC 


TTGGAAATGA 


GAAAAGAGGT 


GGGTAGGCTG 


TGAAAGTGAT 


GACGGCCCAA 


TGTGGGCATC 


CATTCATTCT 


CTTTGCTCCT 


3401 


GAA TTGGGAA 


TTCTCTGCTG 


GGTTCTAGAG 


CCCTTTGGAT 


TTGAAGCTAA 


AGATCACACT 


AGACTTCTAT 


TCTCTCTCTC 


TCTTCTCTTT 


GCTGCTTTTG 


3501 


GGTGGGAGTT 


GTGTGTGTGT 


GGTTCCTTCC 


CTGTCTGGAA 


GATACAGAAT 


GTGAACTGCA 


TTTCTAGAAA 


ATTCACAGTG 


GACCATTCTC 


AGTTCCAATT 


3601 


TAGTTGTTGT 


TTCTTGGGTA 


GGCATTCCTT 


TTTCTTTATC 


AAAGAGTAAG 


ATCTCTTGGC 


CTTCATTTGT 


CCTTTGAACT 


CTACAAGGAC 


CAGAAAGTGC 


3701 


TTTTTGG C TG 


AATGTAGTAA 


GCTGTGTGTG 


TGTGTGTGTG 


TGTGTGTGTG 


TGTGTGTGTG 


TGTGTGTGTG 


TG TGTAATC A 


GGATGCTCCA 


GGGTCAGCCC 


3801 


AGCTAAGATG 


CATATTGTAT 


AGAGCCTAAT 


AAAGTAC TAA 


CATAAAAAAA 


AATAGATAAG 


GCTGGGCATA 


ATGGTGGATA 


CCTCCCAGCA 


CTTGGGAGGC 


3901 


ACAAGCAGGA 


AGATTTTTGT 


GAGTTAGAGG 


AAAGCTCAGT 


C T AC AT ATTG 


AGTTCCAGGA 


TACCCAGGAC 


TACATTGGGA 


GATCCTTTCT 


CAAAAACAAA 


4001 


ACAAAATAAG 


AAAACAAAAA 


ACCCAAAACC 


AACCAACCAA 


CCAACCAACC 


AACCAACCCT 


ACCTAGATTA 


AAATATC TCA 


CATCTTTAAA 


TGCAAAGTAT 


4101 


TTTGACTCCG 


CTTCACAGAA 


AGAAAAAAAA 


AACACAGTCA 


ACATTAATAG 


CCTATTCTCT 


AGGTCTGGCT 


GTGTACAATC 


AACTTTTTAA 


TCTTCGGAAG 


4201 


ATTCCCAATT 


TATTTAGTAC 


TCTAATTATC 


GAGTTAGAGA 


TCATCGCACA 


GACCCTTCCA 


GATGGAATAT 


TCTCTGACTT 


AAATTTATTT 


GCCCCAAAAT 


4301 


ATCATCACAA 


TTTCTGTCCT 


TTACTAAGCT 


CCTACTCTAT 


GCTGGGCAGT 


TTTATCATCG 


AATCGTTCAA 


ATAAA TGT AT 


AAGAGGGACT 


TCAATAAGTC 


4401 


TTTATTTAAA 


CATAGCCAAT 


ACCTAAGTAA 


GAGTACAAAG 


CTTGTAGTCA 


AAAGAGAGCC 


TGGGACTGAG 


CCTAAGGGAC 


CCTTTTTATT 


TATTGGGCTG 


4501 


CTTAGTAAGG 


TTCTAAGAAC 


GTGTAGCTTC 


CCTTTTCTTA 


ATCTCTCCTG 


GCTATCGAAG 


AGCATGACTT 


GGCCCTCTAG 


TGGTGTTTGT 


TGTACCAGTA 


4601 


CTTAAGTGCT 


CATTCTGGTA 


ATCTTTTGCA 


GTTGTGCAAT 


GGCAATTCTG 


ATTGTATGAA 


CAGCGATGAT 


GCACTGTCAG 


AAAACAATCT 


GAAACTTCCA Ex3 


4701 


G AAA T AC AAA 


GAAATGATGG 


ATGCTTCCAA 


ACTGGATATA 


ACCAGGTAGG 


AACTTCTCAC 


ATTTACTTCC 


AGCAATTCTT 


CAGCCAATGT 


TCTCTCTTGC 



FiG. 5. Sequence of the rat IL6 gene. The sequence was determined as described under "Experimental 
Procedures." Exons are underlined and Labeled in the right margin. The TATA box is underscored and the positions 
of the minor and major mRNA cap sites are indicated by arrows, labeled m and M . The extended version of exon 
5 is shown- The polyadenylation signals used for the major and minor mRNA species are indicated by dashed lines 
above the s eq uence ai nucleotides 7 479 an d 8691. . 
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48 01 ATTGATACCT -3CATATATAC AGTCTCAGTC AGGTAACAAA GATCGGGAAG TACATCCAAC ACCACTTGGA TGTAACTGCC CTACACGAGC TTGCATACAA 

4 901 AAGGAGATTC AGAGGCACCT CAGTGGCTGC TAGGGTCAGG GAAAGTTTAC ATCACTCAAT CTACGTCCAT CACGAGCCAA ATGTTTCAAG TCTTTGTTCT 
5001 CCTCTTGCCT TTGAGGTCTT AGATCATTAA AAAAGATTTT TTTTCTCTAA AAAAATGAAA CAGTAGTTAT AAAAAATCTT ATCTAAATGT . CTTGAGCAAA 

S101 GTAACTTACA CCTTTAATTA AAAAGCAATT TCTTTGAACA GGTAAACTCC CACTACACTG TTAAATGTTT AGCAAGTTTA GCTAGTTCAT GTCTAGTACA 

S201 GTAGTTTATG AGCCATTCTC TTCTCTGGTT CCCCCTGGAA GGATTCAGGC CTACTCTCCC TACTGAGGAC ACTCTTCATA CCCCTCTCCT TCCAGCCAGT 

S301 GATATAATCT 3CATGGGTTT CTTATTTCTT AAAAGCAGGT CTCAGGTCAG CCCAGATAAG AACAAGAGTA CAGGAAGTGC ACTGGTGGGT AGAAGGCAAG 

54 01 GAGTCGGGTC AGAAGTAGAT TCTCACTATC ATACTCTGGG ACCCAGGGCG AAATATTAAG TGGTGCCTTT CTCTACTTGT CCAGATAATT AAAGGAGGTC 

5501 AGCTGATGCT GCCTATTGCC CAATTCTGTA GAAAATGCTG AACAGCAGGA ATCTTATTCT CAAATTGAGT CTATCTCCTA GAAAGAACTG ATATCTCCTT 

5601 CCATTTTACT TATAG GAAAT TTGCCTATTG AAAATCTGCT CTGGTCTTCT GCAGTTCCGT TTCTACCTGG AGTTTGTGAA GAACAACTTA CAAGATAACA Ex4 

5701 AGAAAGACAA AGCCAGAGTC ATTCAGAGCA ATA C TG AAA C CCTAGTTCAT ATCTTCAAAC AAGAG GTGAG TGCGTCCCCA TCTCTCATGC AGTGTAGGGA 

5B01 AGAGGTGCTA GGAGAAAACA CCCAGCATCT CAGGGTAGCA GCACTTTTTC CAGATAGCTG CTCAAAAGGG AGAGTCTGAG CAACACTTCT TTGACTGTAA 

5901 GGCCTTGCTT TGTTTTACTT TCCGGTTTTG TTTGGGCCTC TCTTTTGCAA AGACTGTCAA TACTTCCTTT AAACTGTATT AAATAGAATG TTATTAATTG 

6001 CTTAAGAGAT AAGAAAGCCT TAACAGACAA AAGGAAGCAT ATCCAACCTT ACACAGTCAT CCATTATGAG AACATAGAGC ACAAAAGTGA CTTTAATATT 

6101 TAATCCCTGG CAAGTGACAT TTTTGTTACC ACAGACCTAA TCCAGAGAAG TAGCCAAAAG CTAAAATGTC AGGAAAGGAA GGGATCAAGG GCTTCTGGCT 

6201 ACCACTAGCC ATTAAATAGT AGATATGGAG GACTCATGGC TCAGAGTGTG GATGAGTCAA GCCAGATGCG ATAAGAGGGC CTGGAATAGA ACCCTCATGC 

6 301 TAAGGCTGCT TCTGCCACTT GTAG TTT C TT GCCTCAAACC AGGGCATTTT CCAGTTTAAT GTGCACAGAA ACCACCCAGG GTCTTGTTCC AGCAGAGTCT 
6401 CACTTAGAAG GTCTAGCAAC GGAACTGAGA TTCATTTTTG TAAACAGCTG CTTGTTTATG GCCCTGCTAT TTTATTCAGG CTCCTAAGTC ATGGGGCAAG 
6501 CCTTCCTGTT AGTCCCCATG CCAGTGGGTC CACGGTAGAG TGTTTCCATC ATTCTTCATG CTACCACACC AAGAAACTCT CAACAAAGTC TGAATGGAAA 
6601 AGGTGAACGA ATCCATTTGA TTTAGATC CC TCTTCTCTTA TCTGGTAGCA ATACTGTATG TTAACATGGA TATTTGGATA ACAGATGAAT AAGATGGTCC 
67 01 TTTGGATGCT TATTCTCTAC TGGCAGACAG AACAGCAAAG TTAGGATTCT GTTGCTATTG GAAAACTAAT ACTAATCTCC TTGAAAAAAA AAGAATGGAG 
6801 TTCTTAAGCA TGGGTTTCTC TCACGCTAAG CTTGGAGCAA AAGTACCTAC CTCCTCTAAC TTGGGTGAGT CAAAGCAGAT TCACCTAGTT AATCTCATTC 
6901 ATTCTAAATT AGAACTTCCC ATAGCCCAGA GCACTCCACA AGGAAAAAAT TGTCCAATAC TTAACCAGTC TTTCTTTTTT CCTCCTTTAG ATAAAAGACT Ex5 
7001 CATATAAAAT AGTCCTTCCT ACCCCAACTT CCAATGCTCT C CTAA TGGAG AACTTAGAGT CACAGAAGGA GTGGCTAAGG ACCAAGACCA TCCAACTCAT 
7101 CTTGAAAOCA CTTGAACAAT TTCTAAAGGT CACTATGAGG TCTACTCGGC AAACCTAGTG TGCTATGCCT AAGCATATCA GTTTGTGGAC ATTCCTCACT 
7201 GTGGTCAGAA AATATATCCT GTCGATGGGT ATCTAAATTA TGTTGTTCTC TACGAAGAAC TGGCAATATG AATGTTGAAA CACTATTTTA ATTATTTTTA 

7 301 ATTTATTGAT AATTTAAATA AGTAAACTAT AAGTTAATTT ATGATTGATA TTTATACTTT TTATGAAG TG TCACTTGAAA TATTATCTTA TAG TTTTGAA 
7 401 AAGATAATAT AAAAATCTAT TTGATATGAA TATTCTCTTA CCTAGCCAGA TGGTTTCTTG CAATATATAA GTTTACCTCA ATGAATTGCT AATTTAAATT 
7 501 TTTTAAAAAA AAATCTTTGT GATGTATTTT TTAAATTCTG TTTTATTTAC ATTTTAAATG TTATCCCATT TCTCGATTTC CCATCCAGAA ACCTGATATC 

7 601 CCAACCTCAT CCCCTAGTCC CCCACCCTCT CACTCCCCCC TAACCCCCCC TCCGCTTCTA CCCACCCACT CCCTTCCACC TCTCCACCCT GACA TTTCCC 
77 01 TGTACTGGGG CATCGAGCCT TGACAGGACC AAAGGCCTCT CCTCCCATTG ATGTGGACAA GGCCATCCTC T GCTACATAT GCAGATGAAG CCATAGGTCC 
7801 ATCCCTCTCT TCTCTTGGGA TAGTGGTTTA GCTCTGGGAG CTCTGGGCTG TCTAGTTGGT TGATATTGTG TTCTTCTTAT GGGGTTG CAA ATCCTTTCAG 
7901 CTACTTCAGT CGTTTCTCTA ACTCCTCCAT TGGAGACCCT GCTCTCACTT CAATGGTTGG CTACGAGCAT CT GCCTTTGT ATTTGTAAAG CTTTGCAGAG 

8 001 CCTCTCAGGA GACAGCTATA TCAGGCTCCT GTCAGCATGC ACTTCTTTCA TCTGCGATAT TGTTTGGGTT TGGTGGCTGT ATATGGGATG AATCCCATGT 
8101 GGGGCAGTCT CTGGATGGCC TTTCAGTTTT TCAGTCTCTG TTCCACACTT TGTCTCTATA TTTCCTCCTG TGAGTATTTT TTTCCCCCTT CTAAG GTTTG 
8 201 ATGCATCCAC ATTTTGGTCT TCCTTCTTCT TGAGCTTCAT GTGGTTTGTG AATTGTGTTT TGGATACTAC AAGCTTTTGG GCTAATATCT GCTTACCAGT 

8 301 TACTGCATAC CATGTATGTT CTTTTGTGAT TGGGTTACTA TGGGACACTG AAAGACATAC TGGTTCCCAG TTGCGGAGGC TTGAATCCTG GAGACC CTGA 
8401 AGTTTCACCT GACTCCCAGC AAACCAGGGT GGTCACTAAC CCATAGCCCT CCATA GATTT GTGACTATCA GTCACATAAG GGCAACACCC CAAGCCCCTT 
8501 CACACGTAGA GGACACGACC ACAGGTCACA TAGGCTCAAG ACACATATCC GTTATAATGA GGTACATAAA GGCCTGGACT CATAGCCAAT GAATTT CCCT 
B601 TCCCAGACAC TCCTCCCTGC AAAAGGTATT TAATCTCAGG CCCACTCTCA GAAG TGGGA T ATGGTTTTAT TCA TCCACTT TCTGCCATGA CAATAAACAC 
B701 CTTGAAACCA TGGACTGTCT CTTTTCATCG GGATCTGACA TGGGGAGTTA CAAAGAAGGC CTTTGCCTAC AGTGCCACCA TCTAATCTCC CAAAGAAAGC 
8801 CTCCTGCGCT CTCAGCACCA GCAACTGCCA AGCTCAAGCA TACTGCTGAA CTGCTGA CCA GCCAAGGCTA CCCCAGTGGG ACCAGCTGCA GTGTCCCCTC 
8901 TTTCTCCCTC AGCACTGGGC TAGATCCACT TCCACAGCCC CCCACTCCAT TCTCAGCTCT TCCTCTGCAT CCTAGTGGTT CCCAGATGCC CAAGAGCCTG 

9 001 AG AA C T TG AT GACCCTGGCC TCCTTGCAGG CCCAGGCATG GTGAGTCAGC TCTGGCTATG TCTGGATCCC AGACTGTGCT TCCCCACCTC CTGGAACATT 
9101 CCCACAGCCT GACACCCCCA GCAGCAGCAT GGGGTAAGTG CAGTCAAACC TCCTTTATCC TCCCTCCCCT GCCTCCAGAA CCCTGTGGTG GAGCTGATTC 
9201 ACGGAGCCCC ACAATTAATT CTACAGGTTA TCTCACTGAG AATGATATCT TCCATCTATT TGCCTAAGAA TTTCATGAAG TCATTGTTTT TTTATTAGAT 
9 301 TTTTTATTTA CATTTCAAAT GTTATCCCCT TTCCCCCACC CAACCATCCA ACCCTTCCTG CCTCCCCACT CTGACATTCC CCTACATTGG ATGGGGCTCA 
9401 GCCTTGGCAG GATCAAGGCC TTCTCCTCCC ATTGGTGCCC AACAAGGCCA TCCTCACCTA CATATGCAGC TGGAGCCATA GGTCTGTCCA TGTGTACTCT 
9 501 TTGGATGATA GTTTAGTCAC TGGGAGCTCT GGTTCGTTGG TATTGTTGTT CTTACGGGTT GCAAGCCCCT TCAGCTTCTT CATTCCTTTC TCTAGCTCTG 
9601 CCATTGGGAA CCCCATTCTT AGTTCAATAG TTGTCTGTGA GCCTCTGTAC TTGTCATGCT CTGCAGAGCC TCTCAGCAAA CAGATATATC GGGCTCCTGT 
9 701 CAGCATGTAC TTTTTGCCAT CAGCAATATT GTCTGAATTT GGTGACTGTA TGTATATATG CTGGATCCCC ACGTGGAGCA GGCTGTGAAT CGCCATTCCT 
9801 TCAGCCTCTG CTC CAAACTT TGTCTCCATA TATACTCCTA TGAATATTTT TGTTCCCCCA TCTAAGAGGA CTGAACCATC CACACTTTAG TCATCCTTCT 
9901 TCTTGAACTT CATCTGGTCT GTGGATTGTA TCTTGGGTAA TCTGAGCTTT TGGGCTATTT TCCACTTATC AGTGAGTTCA TACCATCTGT GCTTTTTTGT 

10001 GATTGGGTTA CCTTGCTCAG GATGATATTT TCTACTTTCA TCCACTTGCC TATGAATTTC ATGAATT 
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Fig. 6. Mapping the 5' ends of IL6 mRNA- A. mapping by RNase protection. Internally labeled RNA copies 
of a genomic DNA fragment containing exon 1 were produced by subcloning the fragment into a bluescript vector 
and copying each strand separately into a single-stranded RNA (see "Experimental Procedures"). Track J, the 
probe complementary to mRNA, and track 2, the probe of the same strandedness as mRNA were annealed with 
non- radioactive mRNA from LPS-stimulated RM-SVl cells. The reaction products were treated with a mixture of 
pancreatic RNase and RNase Tl, and RNase- resistant fragments were analyzed by electrophoresis on a sequencing 
gel. The two resistant bands labeled by arrows define the map positions of the minor and major start sites ( m and 
M in Fig. 5). The size marker used was radioactively end-labeled Xfspl -digested pBR322 DNA fragments, previously 
calibrated against RNA markers. B, mapping by RNase H experiment. Oligonucleotides 1 and 2. both complemen- 
tary to mRNA, were synthesized based on the cDNA sequence. Oligonucleotide 1 was annealed with RNA extracted 
from LPS-stimulated RM-SVl cells and from inflamed rat livers, and the reaction product was treated with RNase 
generating RNase H-resistant fragments as represented schematically. The resistant fragments were electro- 
phoresed in a polyacrylamide sequencing gel and transferred to a nylon membrane. The blot was hybridized with 
radiolabeled oligonucleotide 2 to reveal the size of the resistant fragments. Tracts 1 and 2, 4-day exposure of the 
autoradio graph; tracts 3 and 4, 1-day exposure. Reactions analyzed in tracks I and 3 were performed with 5 jig of 
poly(A)'* mRNA from LPS-stimulated RM-SVl cells, reactions in tracks 2 and 4 with 10 jig of poly(A)^ mRNA 
from rat liver, 18 h after induction of an experimental acute-phase reaction by intraperitoneal injection of complete 
Freund's adjuvant, as described (29). The sizes of the hybridizing fragments at 79 and 83 bp define the positions 
of the minor and major cap sites (m and M in Fig. 5). The length of the major mRNA species is calculated to be 
1048 bp, without the poly (A) tail, as shown in the bottom line (mRNA M ). The minor mRNA species is four 
nucleotides longer and not shown for simplicity. 
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indicator cell line, calibrated with known standards of recom- 
binant human IL6, we determined that RM-SVl cells accu- 
mulate 110 IL6/BSF-2 units/ml of culture medium in 20 h t 
and that treatment with 10 ug/ml of LPS leads to an 8-fold 
stimulation in 20 h (Table I). Mouse LTK and HeLa cells 
transfected with neomycin-resistant vector only produce low 
background levels of an endogenous IL6-like activity that 
score in this assay with 10 IL6/BSF-2 units/ml and 18.5 
units/ml, respectively, accumulated in 20 h. The line MLTK- 
RIL6.92 accumulates 9-fold increased IL6/HSF amounts in 
the culture medium, and HeLa-RIL6.92 cells accumulate 9.4- 
fold increased levels in the same time period. Thus, the 
transfected cell lines accumulate significantly increased levels 
of IL6 in their culture medium, and we conclude that the 
cDNA clone pRIL6C.92 codes for biologically active rat IL6, 
at least for the HSF activity of IL6, designated IL6/HSF. 

Isolation and Sequence Analysis of the Rat IIJ> Gene — A set 
of three genomic DNA clones contained in phage X vectors 
were isolated from a rat genomic library by screening with 
the cDNA clone pRIL6C91 (see "Experimental Procedures"). 
A restriction enzyme cleavage map for the enzyme £coRI was 
produced, and two contiguous Eco RI restriction fragments of 
clone ARIL5G-9 (a 3.4-kb fragment and a 6.66-kb fragment) 
were r andomly su bcloned into M13 phage vectors and se- 
quenced. The resulting 10.067-kb sequence (Fig. 5) contains 



the entire 5.8-kb rat IL6 gene as well as approximately 2.9 kb 
of 5'- and 1.2 kb of 3 '-flanking sequence. The exon/intron 
boundaries were located by comparison of the cDNA and gene 
sequences. The 5' end of the mRNA was mapped both by 
RNase protection and by RNase H experiments (Fig. 6). The 
result of both experiments demonstrated the existence of two 
mRNA size classes, a minor abundance class starting at 
nucleotide 2903 in the genomic sequence {labeled m in Fig. 5) 
and a major abundance class starting at nucleotide 2907 
{labeled M m Fig. 5). 

The 2.4-kb mRNA Species Contains Additional Sequences 
in the 3' -Nontranslated Region — The experiments performed 
to map the 5' ends of the RM-SVl cell-derived IL6 mRNA 
did not provide evidence for additional transcription start 
sites located further upstream in the gene. Therefore, the 2.4- 
* kb mRNA species could either be derived from a second gene, 
or contain additional intron sequences or additional sequences 
from the region located to the 3' side of the polyadenylation 
site used for the 1.2-kb mRNA species (nucleotide 7505 in 
Fig. 5). To test for additional intron sequences, intron-derived 
oligonucleotide probes were synthesized. Northern blot hy- 
bridization with intron probes was performed, but the 2.4-kb 
mRNA species failed to react. 4 However, the 2.4-kb mRNA 
species selectively hybridized with a 1.22-kb SacI/BamHI- 

* \V. Northerns nn, unpublished results. 
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Fic;. 7, The 2.4-kb minor mKNA species is extended at the 
3' end. RM-SVl cells were stimulated for 18 h with 10 figfra\ of 
bacteria] LPS, and 10-yg aliquots of total RNA were separated by 
electrophoresis. After transfer to nylon membrane (GeneScreen) the 
filter was hybridized either to radioactively labeled rat IL6 cDNA 
(track 1) or to a 1.22-kb Sac I /Bam HI -genomic DNA fragment (track 
2) located 3' of the polyadenylation site of the major mRNA species 
(at 7505 in Fig. 5) in the gene. Hybridization in the high motecuIaT 
weight range in track 2 is probably due to LI repetitive sequence 
elements contained in the radiolabeled genomic DNA probe fragment. 

genomic DNA fragment derived from the gene region down- 
stream of position 7505 (Fig. 7). Therefore, the 2.4-kb mRNA 
species must contain additional sequences that are not present 
in the 1.2-1.3 kb species and which are derived from the gene 
region downstream of nucleotide 7505 in Fig. 5. To establish 
this fact, a second cDNA library of greater complexity than 
the first was produced from poly(A) + mRNA isolated from 
LPS-stimulated RM-SVl cells. This library was screened with 
the cDNA clone pRIL6C.91, and positive isolates were coun- 
terscreened with the 1.22-kb Sacl/BamHl -genomic DNA 
fragment, representing genomic sequences located to the 3' 
side of nucleotide 7505. Several isolates hybridizing with both 
probes were obtained. The isolate with the longest; insert was 
sequenced (data not shown). Its sequence was identical with 
the genomic sequence,, and the polyadenylation site for this 
mRNA species is located at nucleotide 8708 in Fig. 5. Thus, 
the £.4-kb mRNA species contains in its 3 '-non translated 
region an additional segment of 1203 contiguous nucleotides 
beyond those present in the 1.2-1. 3 -kb mRNA species. The 
additional sequences are derived from the 3'-terminal region 
of the gene immediately downstream of nucleotide 7505 in 
Fig. 5. 

The Rat I US Gene fs a Single Copy Gene — To determine 
whether both mRNA species are derived from a single gene 
or from two different IL6 genes, we counted the number of 
rat IL6 genes. Southern blot analysis (Fig. 8) revealed only- 
one IL6 gene in the rat genome. Thus, the rat IL6 gene is a 
single Copy gene, and the 2.4-kb mRNA must; be derived from 
the same gene locus as the 1.2-kb species by use of alternative 
polyadenylation. Therefore, exon 5 of the gene exists in two 
sizes, corresponding to the o'-nontranslated regions of both 
mRNA species (Fig. 9). Sequence analysis of the genomic 
region selectively present in the long version of exon 5 re- 
vealed the pres ence of a re p etitive g enom ic seq uenc e element 
of the Ll class, thai is approximately 800 bp long and 72 




Genomic DNA Genomic Clone 

Fig. 8. The rat IL6 gene is a single copy gene. Rat genomic 
DNA [left) and DNA of the genomic clone XR1.LGG.5 (see Fig. 9, 
right) were digested with EcoRl [lane 1), Win dill itane 2), Sacl (lane 
3), and BamHl (lane 4). Twenty ng of the genomic DNA digest and 
1 /ig of the phage DNA digest were loaded in each lane and separated 
by electroyjhoresis in a 0.9% agarose gel. The DNA was transferred 
to a nylon filter (GeneScreen) and hybridized with the 1L6 cDNA 
clone pRIL6C.91 in 6 x SSC at 63 X- for 36 h, as described (39). The 
cDNA probe was radioactively labeled by nick translation using the 
random primer method (Random Priming Kit. Amersharn Corp.), 
DNA sizes were evaluated with reference to size markers that were 
carried in the same gels inot shown). Fragment sizes in kilobases are 
indicated in the left margin. Equivalent fragments in both gels are 
connected by broken lines. 

80% identical with the reported rat and murine Ll consensus 
sequences, respectively (46-48). 

DISCUSSION 

Microheterogeneity in mRNA length for human IL6 was 
discovered by sequence analysis of variant; cDNA clones dif- 
fering at their 3' ends/' The variant species do not differ 
sufficiently in size and/or are not of sufficient abundance to 
be detectable as separate bands by Northern blot analysis. 
Human and murine IL6 mRNA appears, as far as previously 
published, as a single band in the 1.2-1.3 kb size range on 
Northern blots. The presence of a separate 2.4-kb mRNA 
species 20% as abundant as the major species is a unique 
property of rat macrophages (Fig. 1) and of other rat cell 
types producing IL6." 

The cDNA^ sequence coding for^ the. mature portion of rat 
1L6 is 92% identical with the corresponding murine sequence 
andj589fa _ ide ; ntical with, the, corresponding human sequence, 
while the .degrees of amino acid sequence identity are 93 and 
58%, Respectively. Thus, the two rodent IL6 species are highly 
conserved, but the degree of relatedness between the human 
and rodent IL6 sequences is surprisingly low given the fact 
that recombinant human IL6 acts on rat liver cells and must 
therefore be recognized by the rat hepatocystic IL6 receptor. 
This is surprising because other cytokines such as G-CSF (9) 
do not act. across species boundaries, although their degree of 
inter-species sequence conservation is higher. One might ex- 

" W\ Northemann. J. Ganldie. and G. H. Fey. unpublished results. 
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Fig. 9. Exon/intron block struc- 
ture of the rat IL6 gene. The inserts 
of the three overlapping genomic phages 
ARIL6G.3, XRIL6G.5, and ARIL6G.9 are 
shown as open boxes. The insert lengths 
in kilobasea are given underneath. Eco RI 
restriction sites are labeled (E). The 
transcribed portion of the gene is shown 
in black. Bottom line, expanded restric- 
tion map of the 10.06-kb sequenced por- 
tion of the gene (two contiguous EcoKl 
fragments), showing additional sites for 
the enzymes Hindlll {H) t BamHl (B), 
Sac I {S), and Bstxl {Bx). Exons are 
given as black boxes; the extended form 
of exon 5, created by alternative poly- 
adenylation, is shown as a shaded box. 
Exon and intron lengths in kilobases are 
given underneath. 
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pect to find the receptor-binding domain of IL6 contained in 
a part of the molecule that is conserved among the three 
species. 

The assignment of the NH 2 -terminal amino acid of rat IL6 
(Fig. 3) is not based on analysis of purified mature rat IL6 
but on comparison of the cDNA derived sequence with murine 
IL6. Mature murine IL6 occurs with two different Non- 
terminal residues, resulting from cleavage of the precursor 
polypeptide by signal -peptidases at two adjacent sites (43, 44). 
Since the rat IL6 protein sequence is identical in this region 
with murine IL6, we predict mature rat IL6 may similarly 
occur with two possible NH 2 -terminal residues, provided the 
rat signal peptidases have similar sequence specificities as 
their murine counterparts. 

Human IL6 has been reported to be both N-glycosylated 
and O-glycosyiated, while the murine IL6 sequence lacks 
potential N-glycosylation sites, and N-gly cosy lat ion has not 
been detected for murine IL6 (32, 33). Rat IL6 is similar to 
murine IL6: the cDNA sequence predicts no N-glycosylation 
sites. Direct studies of glycosylation of the rat protein have 
not yet been reported. It is possible that rat IL6 carries O- 
linked carbohydrates. It will be interesting to investigate 
whether any one of the multiple functional properties of 
human IL6 are specifically dependent on N-glycosylation and 
whether rat and murine IL6 lack those properties. 

The number of cysteinyl residues is conserved between 
murine and human IL6, and therefore, presumably both mol- 
ecules have the same disulfide bond connectivity. The same 4 
cysteinyl residues are also found in rat IL6, plus an additional 
fifth cysteinyl residue. It is to be expected that the 4 conserved 
rat cysteines will be engaged in the same disulfide- bond 
pattern as in murine and human IL6, and therefore the fifth 
rat cystein is expected to be unpaired. This would endow rat 
IL6 with the possibility to engage into an intermolecular 
disulfide bond or to block this reactive group by other second- 
ary modifications. In the future it will be interesting to 
investigate whether rat IL6 has an increased propensity over 
human and murine IL6 to form dimers or aggregates with 
other molecules by virtue of this additional cysteinyl residue 
or whether it displays other functional particularities due to 
secondary modifications of this residue. 

The cell line RM-SVl is a convenient source of rat IL6 
mRNA of constant quality. The line is believed to have 
originated by transformation of a macrophage with SV40, 
_although_we_cannot-formally-exclude^the_possibility._that_it. 



was generated by transformation of a monocyte or another 
cell type present in adherent primary PEC cultures. However, 
since macrophages and monocytes are the most abundant cell 
types among adherent PECs, and since the line has a macro- 
phage- like morphology with ruffled edges and presents other 
' traits characteristic of macrophages, 4 we refer to it as a 
macrophage-derived cell line. A characteristic property of 
primary peritoneal macrophages is their failure to divide in 
culture. The line RM-SVl grows indefinitely, and therefore 
clearly differs from primary macrophages. It shares with 
primary macrophages the important property of being induc- 
ible by LPS. The line produces residual levels of ILl and IL6 
mRNAs and of IL6/HSF activity, and both mRNAs and the 
IL6/HSF activity are clearly inducible by treatment with LPS 
(Fig. 4 and Table I). 

The cDNA clone pRIL6C.92 clearly codes for biologically 
active rat IL6 because both human and murine cell lines 
transfected with these rat cDNA sequences show an approx- 
imately 9-fold increase in secreted IL6/HSF activity over 
background. The levels of IL6 activity secreted by the trans- 
fected lines are in the range of those produced by nonstimu- 
lated macrophages (100-200 IL6/BSF-2 units/ml accumu- 
lated/20 h) and about 2 -fold less than the levels found in a 
strain of primary human lung fibroblasts (Ven 8 strain), which 
accumulate 200-300 IL6/BSF-2 units/ml in 20 h. 6 

The exon/intron structure of the rat IL6 gene (Fig. 9) 
closely resembles those of the murine and human IL6 genes, 
as well as that of the human G-CSF gene (49, 50). The 
positions of exon/intron boundaries and the exon lengths are 
conserved in the protein coding portion of the gene, but the 
genes differ at the 5' boundary of exon 1 and the 3' boundary 
of exon 5, which are located outside the coding part. For the 
human IL6 gene, three cap sites have been reported (Cl to 
C3), with the closest site (Cl) located 63 to 64 nucleotides 5' 
of the translation start triplet, the second site (C2) at —86 to 
—88 and the third site (C3) at —176, and all preceded by 
conventional TATA-box sequences (49). In the rat gene only 
two cap sites were detected (m and M in Fig. 5), separated by 
only four nucleotides, located 67-71 bp upstream of the trans- 
lation-initiation triplet, and preceded by a conventional 
TATA-box sequence. No sites located further upstream were 
detected by using mRNA prepared from the rat macrophage- 
derived cell line RM-SVl. We cannot exclude the existence 
of additional potential sites of transcription initiation for the 

!!.M..Hattori and J._Gauldie, unpublished.results.__ 
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Fig. 10. Comparison of polyadenylation sites used in the 
rat, murine, and human IL6 genes. The AATGAA polyadenyl- 
ation signal used for the major 1.2-1.3-kb rat IL6 mRNA species and 
the corresponding polyadenylation site {A) n with the distance be- 
tween both in base pairs (6) are given in the top row at the left. The 
position of the inserted LI repetitive sequence element (730 bp) is 
indicated. The canonical AATAAA polyadenylation signal used for 
the minor 2.4 -kb mRNA species, the distance in base pairs (6), and 
the location of the polyadenylation site (A ) n are shown. The distance 
between the two polyadenylation sites (1212 bp) is indicated below. 
The mouse genome contains two potential polyadenylation signals, 
but as far as it is known only the canonical signal AATAAA is used. 
The human genome contains a very similar arrangement as the mouse 
genome with two sites at a distance of 78 bp, and both are used. 

rat IL6 gene that may only be used in cell types other than 
macrophages. Differential utilization of the three start sites 
of the human gene was reported in different cell types ex- 
pressing human IL6 (49). 

The presence of two different mRNA species differing by 
1.2 kb in their 3'-nontranslated regions and generated by 
alternative polyadenylation is a unique property of the rat 
IL6 gene. The longer version of exon 5 of the rat IL6 gene 
contains a sequence of the Ll repetitive sequence family, 
located between nucleotides 7537 and 8337 in Fig. 5. This 
sequence is approximately 72 and 80% identical with the 
reported consensus rat and murine Ll sequence elements, 
respectively (46-48). Thus, the rat gene differs from its human 
and murine counterparts either by the insertion of this se- 
quence element into the rat gene after speciation, or by the 
loss of this sequence from both the human and murine genes 
after speciation. The murine and human genes have highly 
conserved sequences in this area extending until a position 
equivalent to nucleotide 7536 in the rat gene (Figs. 5 and 10), 
and an independently occurring loss at the same position in 
both species is highly improbable. Therefore, it is almost 
certain that these sequences were inserted into the rat gene 
after speciation. While Ll sequence elements are not uncom- 
mon and frequently recornbine, there are few instances, in 
which they have been reported to become part of a transcrip- 
tion unit and to remain present in a stable mRNA species 
(48). In this respect the rat IL6 gene represents a so far unique 
case. 

We refer to the polyadenylation signal as the hexanucleo- 
tide sequence that is a variant of the canonical sequence 
AATAAA and that precedes the site of attachment of the 
poly (A) tail (the polyadenylation site) usually by approxi- 
mately 13-20 nucleotides. The major 1.2-1.3 kb rat IL6 
mRNA species is produced by using the polyadenylation sig- 
nal AATGAA, located 20 bp upstream of the major polyade- 
nylation site at position 7505 (Figs. 5 and 10). The minor rat 
IL6 mRNA species contains an additional 1203 nucleotides 
in its 3'-nontranslated region and is generated by using the 
canonical polyadenylation signal AATAAA at position 8692 
-and-the polyadenylation-site at position 8708-(FigSr-5-and 10), - 



Two potential polyadenylation signals and polyadenylation 
sites have also been detected by sequence analysis of variant 
human IL6 cDNA clones. While the clones published by 
Yasukawa and colleagues (49) are generated by using the site 
corresponding to the minor site in the rat gene (Fig. 10), 
clones generated by utilization of the more 5' -located site, 
corresponding to the major polyadenylation site of the rat 
gene, have also been found. 5 The published cDNA clones of 
murine IL6 utilize the more 3' -located polyadenylation site 
(Fig. 10). However, at the same distance as in the human 
gene, 78 bp upstream of this site, the same polyadenylation 
signal AATGAA is found in the murine and rat IL6 genes. 
We do not know whether this signal is also used for the 
generation of a second murine IL6 mRNA species. It is also 
unknown whether the mRNA species generated by different 
polyadenylation have different structural or functional prop- 
erties (mRNA folding, mRNA half- life, transport, translation 
efficiency, etc.). 

With the availability of the rat IL6 genomic clones, the 2.9 
kb of 5 '-flanking region sequence and a novel rat macrophage - 
like cell line with inducible expression of the IL6 gene, the 
tools are now in hand for a further investigation of the signals 
and molecular mechanisms leading to the induction of this 
gene. This induction is a key event in the early stages of an 
inflammatory response and is possibly also involved in the 
process of growth transformation. 
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SUMMARY Using a reverse transcriptase-polymerase chain reaction method, a cDNA for the 
complete coding region and part of the 5 '-untranslated region of rat IL-13 was cloned from rat 
renal cortex RNA following the induction of anti-glomerular basement membrane antibody- 
induced glomerulonephritis. The coding region of IL-13 cDNA displays 74% and 87% sequence 
identity with the coding regions of human and mouse IL-13 cDNA, respectively. The deduced 
amino acid sequence of rat IL-13 reveals 63% and 79% homology with the human and mouse 
proteins, respectively. Using the rat IL-13 cDNA as a molecular probe, increased production 
of IL-13 mRNA in renal and splenic tissue is demonstrated in the first 48 hours of antibody- 
induced glomerulonephritis in the rat. The data suggest a role for IL-13 in the inflammatory 

response in ViVO. « 1993 Academic Press, Inc. 



Interleukin-13 (IL-13) is a recently discovered lymphokine produced by the T H 2 subset of helper 

T cells (1,2). The genes for human and murine IL-13 have been cloned and characterized (1-4). 

Human and murine IL-13 proteins display approximately 60% amino acid sequence identity but 

lack significant homology with other lymphokines (1,2). Recombinant IL-13 stimulates the 

proliferation of anti-IgM or anti-CD40 activated human B lymphocytes and induces IL-4- 

independent IgG4 and IgE synthesis (2,5,6). It also induces the differentiation of human 

monocytes into dendritic-like cells and upregulates the surface expression of MHC class II 

antigens and the low-affinity receptor for IgE (CD23 or FceRII) on these cells (2). Like IL-4 

and IL-10, IL-13 displays anti-inflammatory actions on macrophages (1). It inhibits IL-10, IL-6, 

IL-8, and TNFo; mRNA production in lipopolysaccharide-treated monocytes (1). 

Anti-glomerular basement membrane (anti-GBM) antibody-induced glomerulonephritis 

in the rat is an inflammatory renal disease characterized by infiltration of the kidneys with 

|To whom correspondence should be addressed at VAMC, Research-151, 1670 Clairmont 
Road, Decatur, GA 30033. FAX (404)728-4851. 

Abbreviations: CD, cluster of differentiation; GBM, glomerular basement membrane; IFN, 
Interferon; EL, Interleukin; RT-PCR, reverse transcriptase - polymerase chain reaction; T H , 
helper T-lymphocyte; TNF, Tumor Necrosis Factor; UDG, Uracil DNA Glycosylase. 
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neutrophils, T-lymphocytes, and macrophages (7). In this study, we report the isolation of a rat 
IL-13 cDNA clone and its application as a molecular probe to examine IL-13 gene expression 
during the inflammatory response in vivo. We demonstrate an increase in IL-13 mRNA in rat 
glomeruli between 2 and 48 hours following the induction of glomerulonephritis. Based on its 
in vitro actions on macrophages, IL-13 may play a modulatory role in inflammatory renal 
disease. 



MATERIALS AND METHODS 

Induction of Glomerulonephritis. Male Sprague-Dawley rats (200-250 gms) (Charles River, 
MA) were injected in the femoral vein with 150 fxl of rabbit anti-rat GBM serum (kind gift of 
George F. Schreiner, CV Therapeutics, Palo Alto, CA). Control animals were injected with an 
equal volume of normal rabbit serum. Rats were sacrificed at 2, 16, 24 and 48 hours post- 
injection (n=4 per time point). Kidneys were flushed with ice-cold Hanks Balanced Salt 
Solution (HBSS) until blanched and removed surgically. Glomeruli were isolated on ice by 
graded sieving (8) and used immediately for RNA extraction. Disease induction was confirmed 
by histological examination of the renal tissue and by measuring urinary protein to creatinine 
ratio at time of sacrifice. 



Isolation of RatH^13 cDNA Clones. A RT-PCR cloning approach was utilized to isolate a 
cDNA encoding rat IL-13. Total cellular RNA was extracted from the cortex of 
glomerulonephritic kidneys by the Chomczynski method using RNAzol B reagent (Biotecx 
Laboratories, TX) (9). RNA was reverse transcribed using an oligo-dT primer and Moloney 
Murine Leukemia Virus reverse transcriptase (Perkin Elmer, CT). Two rounds of PCR 
amplification were performed using nested pairs of oligonucleotide primers (Table I). Both 
primer pairs were selected based on areas of high homology in the untranslated regions (UTR) 
of human and mouse IL-13 cDNA (1,4). Two pg of reverse transcribed RNA was amplified 
under the following conditions: lOmM Tris HC1 (pH8.3), 50mM KC1, ImM MgCl 2 , 5% 
Dimethyl Sulfoxide (DMSO) (Sigma, MO), 0.2mM of each dNTP, 500 nM of each primer (pair 
A), 2.5 units of Thermus aquaticus DNA polymerase (Taq polymerase, Perkin-Elmer, CT), 
denaturation at 94 °C for 1 min, annealing at 42 °C for 2 min, extension at 72 °C for 2 min for 
40 cycles followed by extension at 72°C for 5 minutes using a DNA Thermal Cycler 480 



TABLE I. PCR PRIMERS UTILIZED IN THE ISOLATION OF RAT IL-13 cDNA 



Primer pair 1 : Sense 

And- sense 

Primer pair 2 : * Sense 
(nested) Anti-sense 



5 '-gacaagccagcagccta-3 ' 
5 '-ggcaggcagtgcaggctgagg-3 

5 '-tggctctgggcttcatggc-3 ' 
5 '-tgtgagaaaggaaaatga-3 ' 



*In_order_to-(done_the_RT^RCR.-produ^ include 

deoxy-UMP residues at the 5 ' end as follows: 5 '-caucaucaucau-3 '. 
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(Perkin-Elmer, CT) . 5/d of the above PCR product (total volume = lOOfd) were removed and 
re-amplified under the same conditions using nested primer pair B (Table I & Figure 1 . 
Amplified products were electrophoresed in 3% SeaKem LE agarose (American Bioanalytical, 
MA). Expected target DNA (-470 bp) was excised from the gel and punfied by 
phenol/chloroform extraction and ethanol precipitation according to standard techniques (10) and 
cloned into pAMPl plasmid utilizing Uracil DNA Glycosylase according to instructions provided 
with the CloneAmp R System (Gibco BRL, MD)(11). Following transformation of E coli DH5a 
fBRL MD), separate colonies were picked, grown overnight, and plasmid DNA punfied 
(Oiagen CA). Sequencing of both strands of the cloned PCR product was performed by the 
Sanger method (12) using oligonucleotide primers that anneal to. the SP6 and T7 promoter 
regions of pAMPl (Gibco BRL, MD). 



Analysis of IL-13 Gene Expression. Based on the sequence of the isolated rat IL-13 cDNA 
a species specific PCR primer pair which amplifies a 279 bp segment of rat IL-13 was designed 
using PCRPLAN software (pcgene, Switzerland). The sequences for the sense and antisense 
primers were 5'-cagggagcttatcgaggagc-3 ' and 5'-aagttgcttggagtaattgagc-3 ' respectively^ Two 
ug of total RNA extracted from glomeruli at 0, 2, 16, 24, and 48 hours following the induction 
of glomerulonephritis was reverse transcribed as described above and PCR amplified under the 
following conditions: lOmM Tris.HCl (pH8.3), 50mM KCl, -0.6mM MgCl, 5% DMSO, 
denaturation at 94°C for 1 min, annealing at 55°C for 45 sec, extension at 72 C for 45 sec for 
40 cycles. PCR products were electrophoresed on 2% agarose gels. 

Southern Blotting. PCR product electrophoresed as described above was transferred to 
Hybond-N+ nylon membrane (Amersham, IL) by capillary Southern blotting (10) and covalently 
linked by UV-irradiation using a Stratalinker UV crosslinker (Stratagene, CA). Rat IL-13 cDNA 
was purified from the plasmid pAMPl-IL13 following digestion with EcoRI and BamHl (New 
England Biolabs, MA). The cDNA was denatured and labeled with the enzyme horseradish 
peroxidase using the Enhanced Chemiluminescence (ECL) direct nucleic acid labelling and 
detection system (Amersham, USA). The membrane was hybridized overnight with the labeled 
cDNA probe in 0.5M NaCl solution at 42°C. Following hybridization, the membrane was 
washed in 0.5xSSC and detection reagents were added as described by the manufacturer 
(Amersham, IL). Autoradiography was performed at room temperature for 1 minute. 



RESULTS AND DISCUSSION 

Rat IL-13 cDNA cloning. The RT-PCR cloning strategy utilized in this study resulted in the 
isolation of a single rat IL-13 cDNA (Fig. 1). The nucleic acid sequence of the RT-PCR 
product cloned into pAMPl was identical in six different clones that were sequenced. The 
longest open reading frame encodes a 131 amino acid polypeptide and displays 74% and 87 & 
nucleic acid sequence identity with the coding regions of hIL-13 and mIL-13, respectively. 
Deduced amino acid sequence of rat IL-13 reveals 63% and 79% homology with the human and 
mouse proteins, respectively (Fig. 2). The probable signal peptide cleavage is predicted to occur 
after amino acid 20 resulting in a mature polypeptide of 111 amino acids. The predicted 
molecular weight of the mature protehTis T2,T26-DaT^ Airfivercysteine- residuesT-including-one- 
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10 20 30 40 50 

I I I I I 

! TGGCTCTGGGCTTCATGGC ACTCTGGGTGACTGCAGTCCTGGCTCTCGCT 

5 1 TGCCTTGGTGGTCTTGCCACCCCAGGGCCAGTGCGGAGATCCACATCTCC 

101 CCCTGTGGCCCTC AGGG AGCTTATCG AGGAGCTGAGCAACATCACACAAG 

151 ACCAGAAGACTTCCCTGTGCAACAGCAGCATGGTATGGAGCGTGGACCTG 

2 01 ACAGCTGGCGGGTTCTGTGCAGCCCTGGAATCCCTGACCAACATCTCCAG 
251 TTGCAATGCCATCCACAGGACCCAGAGGATATTGAATGGCCTCTGTAACC 

3 01 AAAAGGCCTCGGATGTGGCTTCCAGCCCCCCAGATACCAAAATCGAAGTA 

3 51 GCCCAGTTTATATCAAAACTGCTCAATTACTCCAAGCAACTTTTCCGCTA 

4 01 TGGCCACTGAGGGGAGACCGGCCCTGGACATCTCAGCTGTGGACCTCAGT 
4 51 TGTGGAT TCATTTTCCTTTCTCACA 

FIGURE 1 . Nucleotide sequence of rat IL-13 cDNA. The initiation and stop codons of the 
longest open reading frame are double underlined. Underlined nucleotides correspond to mouse 
oligonucleotide primers used to amplify the rat IL-13 cDNA (primer pair 2, Table I). 



in the putative signal sequence, are conserved in human, rat, and mouse IL-13. Rat IL-13 
contains four potential N-linked glycosylation sites, three of which are conserved in human and 
mouse IL 13. Rat IL-13 does not share any significant amino acid identity with rat IL-4 (13). 

The cross-species RT-PCR cloning strategy utilized in this study allows high efficiency 
isolation of target cDNA. Performing a second round of PCR using a nested primer pair made 
possible the cloning of rat IL-13 cDNA from non-lymphatic tissue which contains relatively few 



M0USEIL13 - 



RATIL13 



HUMANIL13 - 



MALVWTAVT^J^CLGGLAAPGPVPRSVSLPLTL 

I I I I II I I I I I I I I ! I I I I I I I Mill IIIIIIIMIIII I 

MALWVTAVLALACLGGLATPGPVRRSTSPPVALRELIEELSNITQDQKTS 

IN IIIIIIIMIIII II MINIMI till II 

MALLLTTVIALTCLGGFASPGPVPPST ALRELIEELVNITQNQKAP 



-49 

-50 
-46 



M0USEIL13 - 



LCNGSMVWSVDLAAGGFCVALDSLTNISNCNAIYRTQRILHGLCNRKAPT -99 

Ill MINIM Mill 1 1 MINI INI IMMI INI II 

RAT3X13 - LCNSSMVWSVDLTAGGFCAALESLTNISSCNAIHRTQRILNGLCNQKASD -100 

— Ill INN INI II III I I II I II III I I I II 

HUMANIL13 - LCNGSMVWSINLTAGMYCAALESLINVSGCSALEKTQRMLSGFCPHKVSA -96 



MOUSEIL13 - TVSSLP DTKIEVAHFITKLLSYTKQLFRHGPF -131 

J I I III MM I N I III I 

RATIL13 - VASSPP DTKIEVAQFISKLLNYSKQLFRYGH -131 



HUMAN IL13 - GQFSSLKVRDTKIEVAQFVKDLLLHLKKLFREGRFN -13 2 

FIGURE 2. Homolo gy alignment of the deduced amino acid sequences of mcjuse, rat, and 



human IL-13 proteins. Similarities between two or three species are indicated (1 
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lymphocytes. In fact, we isolated rat IL-13 cDNA from renal cortical tissue following the 
induction of glomerulonephritis during which only a small number of lymphocytes infiltrate the 
kidney (7). 

Analysis of IL-13 gene expression in vivo. Injection of antibodies directed to the GBM of the 
rat kidney induces an inflammatory response characterized by infiltration of the glomerulus by 
neutrophils, T-lymphocytes, and monocytes (7). The in vitro actions of IL-13 on macrophages 
suggest that this T-lymphocyte-derived lymphokine may modulate macrophage function in vivo. 
In order to determine whether IL-13 is expressed during the course of inflammatory disease, we 
isolated rat glomeruli following the induction of anti-GBM glomerulonephritis and studied IL-13 
mRNA production by RT-PCR. The primer pair utilized is species specific and is based on the 
rat IL-13 cDNA sequence (Figure 1). As shown in Figure 3A, IL-13 mRNA increases 
significantly 2 hours following the induction of glomerulonephritis and can still be detected at 
48 hours. Control rats injected with normal rabbit serum did not reveal increased glomerular 
expression of IL-13 over baseline, and GAPDH mRNA expression analysed by RT-PCR was 
constant over time (data not shown). IL-13 mRNA is also detected in rat spleen following 
injection of anti-GBM antibody (Fig. 3B). The identity of the PCR product was confirmed by 
Southern blotting and probing with rat IL-13 cDNA (Fig. 3C). 

The data demonstrates expression of the IL-13 gene during an antibody-mediated 
inflammatory disease. We have previously observed transient T H 2-lymphokine (IL-4 and IL-10) 
gene expression in rat glomeruli in the first 24 hours of anti-GBM nephritis (14). As predicted 



TIME (hrs) 



0 2 16 24 48 



B 




TIME (hrs) 
0 2 16 24 48 



<. TIME (hrs) 
0 2 16 24 48 





FIGURE 3 Analysis of rat IL-13 mRNA production in rat glomeruli (A) and spleen (B) 
following the induction of anti-GBM glomerulonephritis. 20^1 of RT-PCR product from each 
time point was subjected to 2% agarose gel electrophoresis and stained with ethidium bromide 
Marker^X-l-74-RF-DNA^^ (C)_ -Southern blot_analysis of 

glomerular RT-PCR product using rat IL-13 cDNA as . a chemiluimnescent probe. 
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from the in vitro actions of IL-4 and IL-10 (15), this T H 2 response was associated with decreased 
IL-1 and T H l-lymphokine (IFNy and IL-2) mRNA production suggesting that T H 2 lymphokines 
may modulate glomerular inflammation. Whether IL-13 plays a role in regulating macrophage 
function in vivo remains to be determined. The availability of rat IL-13 cDNA should allow the 
generation of recombinant rat IL-13 and of neutralizing antibodies necessary for further analysis 
of the role of IL-13 in experimental glomerulonephritis in the rat. 
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SUMMARY A cDNA of the complete coding region of rat IL-10 was cloned and sequenced 
using RNA isolated from a cultured population of thoracic duct T-lymphocytes obtained from 
Trichinella spiralis infected animals. The OX8" OX22"T-helper cells were stimulated in vitro 
with Concanavalin A for 24 hours prior to harvest. Reverse transcription of cellular RNA was 
primed with oligo-dT followed by amplification of IL-10 specific cDNA by polymerase chain 
reaction with synthetic oligo nucleotide primers chosen from two highly conserved regions of 
mouse and human EL- 10. The sequence of the coding region of the amplified, cloned rat IL-10 
cDNA is .90% identical to the mouse and 82% identical to the human IL-10 cDNA coding 

regions. C 1992 Academic Press, Inc. 



Mouse interleukin 10 (mIL-10) was originally named cytokine synthesis inhibitory 
factor (CSIF) due to the inhibitory activity of this molecule on the synthesis of interferon gamma 
(IFN-y) by a mouse T cell clone (1). Analysis of the cDNA and protein sequences of mIL-10 
indicated striking homology with a partially characterized product, BCRFI, encoded by the 
Epstein-Barr virus genome (2). A human cDNA of DL-10 (hDL-10) was characterized and shown 
to be nearly 73% identical in deduced amino acid sequence with the deduced amino acid 
sequence of mIL-10. Both hIL-10 and BCRFI have been shown to inhibit IFN-y production (3). 
Murine IL-10 has been shown to stimulate proliferation of CD4" CD8" thymocytes (4). Human 
monocytes, in response to LPS stimulation, produce IL-10 which may function in an- 
autoregulatory role by down regulating class II MHC molecules and inhibiting the production of 
proinflammatory cytokines (5). During parasitic infections the establishment of resistance or 



*Sequence data- from this article have been deposited with the EMBL/GenBank Data 
Libraries under Accession No. L02926. 
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Abbreviations: BCRFI, Epstein-Barr virus open reading frame I; CSBF, cytokine synthesis 
inhibitory factor; Con A, concanavalin A; LL-, interleukin-; RT-PCR, reverse 
transcriptase-polymerase chain reaction. 
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acute disease appears to be at least partly determined by the dominance of IL-10 or EFN-y, 
depending on the specific parasite (6,7). 

Our preliminary studies examining the cytokine profile of a protective T helper cell 
population (OX8~ OX22") isolated from the thoracic duct of rats infected with the nematode 
Trichinella spiralis (8), indicated that these cells exhibit some characteristics of Th2 phenotype 
(9). Stimulation of the isolated cell population in vitro with Con A or T. spiralis specific 
antigen, led to the secretion of factors that caused isotype switching of B cells from IgM to IgE, 
an IL-4 response, and the differentiation of eosinophils from bone marrow cells, an IL-5 
response (unpublished, K. Ramaswamy and R. Bell). Analysis of RNA from these cells by 
reverse transcriptase-polymerase chain reaction (RT-PCR) using primers for EL-2, IL-4, IL-5 
and IFN-7 indicated that mRNA of all four cytokines were present in the population 
(unpublished, R.Goodman). Identical analyses of the non-protective T cell population (0X8" 
OX22 ) which was isolated concurrently from the same animals and stimulated identically, 
produced a similar mRNA profile but those cells were unable to elicit eosinophil differention 
even though IL-5 mRNA was present. Both cell populations contain significant levels of IFN-y 
mRNA. To determine whether the differences between these cell populations where in pan due 
to differences in IFN-y or IL-10 production, we have developed a RT-PCR assay for rat IL-10 
using an oligonucleotide primer identical to the translation initiation coding region of mIL-10, 
having three mismatches with- human IL-10, and an antisense oligonucleotide for the 
3 '-untranslated region which is complementary to mIL-10, but differs in the distal half from 
hIL-10. To verify the identity of the product we have cloned and sequenced the resulting 
cDNA and report our findings here. 

MATERIALS AND METHODS 

Cellular RNA was isolated from pulverized frozen rat tissues and from separated 
Con A stimulated OX8" OX22" thoracic duct T lymphocytes (8) of T. spiralis infected AO rats, 
by the guanidine/acidic phenol rchloroform method of Xie and Rothblum (10). RNA concentra- 
tions were determined by measuring UV absorbance at 260 nm. The integrity of the RNA was 
checked by visualizing the ethidium bromide stained rRNA bands in the formaldehyde denatur- 
ing gel used for the Northern blot which was performed as described earlier (11). 

Oligonucleotides for IL-10 were selected by comparing mouse (1,13) and human 
(3) sequences to identify highly conserved regions. Specific sequences of the oligos are identi- 
cal to those of mouse IL-10 at the translation initiation site in exon 1 (oligo#l, 5'- 
dfcatgcctggctcagcactgc], sense strand), in the 3'-untranslated region (oligo#2, 5'- 
d[gggaactgaggtatcagagg], antisense strand), and in mid-exon 1 (oligo#3, 5*- 
d[gactgggaagtgggtgcag], antisense strand). Oligo#l and oligo#2 were used as primers for poly- 
merase chain reaction (PCR) amplification. 01igo#3 was labeled and used to probe Southern 
blots. 

RT-PCR reactions were performed to clone rat IL-10 and to detect small quantities 
of IL-10 mRNA using cellular RNA, oligo#l and oligo#2 (above) and a GeneAmp RNA 
PCR Kit (Perkin-Elmer) following the manufacturers directions. To clone the rat IL-10 cDNA, 
total RNA was used from OX8" OX22" cells (above) which were stimulated in vitro with Con 
A for 24 hours before collection. First strand synthesis of 0.5 jig of RNA was performed at 
42°C using Oligo-dT as the primer. PCR was accomplished by the addition of AmpliTaq DNA 
Polymerase and oligonucleotide primers #1 and #2 (to a 250 nM final concentration of each), 
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followed by 35 cycles of amplification (94°C, 45 s; 50°C, 45 s; and 72°C, 80 s) and a final 
primer extension at 72°C for 5 minutes. Comparative IL-10 RT-PCR assays using cellular 
RNA from various tissues were performed in an identical fashion except the PCR was stopped 
after 25 cycles and identical samples of 0.03 ug of RNA were also analyzed. Southern blotting 
for detection of the amplified EL- 10 cDNA products was accomplished electrophoresis of sam- 
ples in a 3% NuSieve 3:1 agarose gel (FMC) with 1 X Tris-acetate-EDTA buffer (15), denatura- 
tion of the DNA with sodium hydroxide and neutralization with Tris base (pH 8) followed by 
capillary transfer to GeneScreen Plus membranes (DuPont-New England Nuclear) with 10 X 
SSC (12). Blots were prehybridized and hybridized in the same solution (20 mM Tris, pH 8; 6 X 
SSPE ; 1 mM EDTA; 5 X Denhardt's solution; 50 /xg/ml sheared, denatured Salmon sperm 
DNA; and 0.5% SDS) at 50°C. 01igo#3 was labeled with [ 32 P]cordycepin-5 , -triphosphate 
(DuPont-NEN) using Terminal deoxynucleotide transferase (Promega) as described by Tu and 
Gohen (14) for use as a probe. Hybridization was at 50°C overnight. Unbound probe was 
washed off with two incubations each in 6 X SSPE, 0.1% SDS at room temperature and at 50°C 
Bands were detected by exposing X-OMAT AR film (Kodak) with washed membranes. Band 
sizes were determined by comparing migration distances with bands from a 123 bp sizing ladder 
(Gibco-BRL) electrophoresed beside RT-PCR samples and detected by ethidium bromide stain- 
ing. 

To clone the IL-10 cDNA, a sample from RT-PCR was electrophoresed through 
1.0% SeaKem GTG agarose (FMC), with 1 X TAE, then excized from the gel after ethidium 
bromide staining, and purified with Geneclean (BIO 101). The cDNA, approximately 680 bp 
long, was ligated with pT7Blue(U) plasmid DNA (Novagen, Inc.) using T4 DNA Ligase (New 
England Biolabs). NovaBlue competent Escherichia coli were transformed with ligation prod- 
ucts and recombinant colonies were selected on Ampicillin/Tetracycline plates containing color 
substrates for the Lac Z gene product. White colonies were screened by hybridization with 32 P- 
labeled oligo#3 to select IL-10 bearing plasmids which were amplified and purified from "mini- 
preps" (15). 

Plasmids were denatured with sodium hydroxide, and sequenced with Sequenase 2.0 
(U.S. Biochemicals) using vector specific primers T7 S'-d(taatacgactactataggg) and Ml 3 
Forward-40 5*-d(gttttcccagtcacgac), with separation by electrophoresis through a 5% HydroLink 
Long Ranger gel (J.T.Baker) and as described earlier (11). Sequencing reaction products 
were detected by autoradiography and were compared with mouse and human IL-10 sequences 
using the ALIGN program (Scientific and Educational Software, State Line, PA). Protein struc- 
ture calculations for the deduced mature protein were accomplished using the IBI Pustell pro- 
gram (Kodak, New Haven, CT). 

A Northern blot of cellular RNA from rat brain and Con A stimulated T cells was 
performed as described earlier (11). The cloned rat IL-10 cDNA insert was labeled with 
[ 32 P]dATP by random primed cDNA synthesis using a Prime-a-Gene kit (Promega) for use as 
a probe: The IL-10 mRNA size was determined by comparing the migration distance of the de- 
tected band with the distances of 28S and 18S rRNA bands detected by ethidium bromide stain- 
ing of the original gel. 



RESULTS AND DISCUSSION 
Reverse transcription and PCR amplification of RNA from Con A stimulated OX8" 
OX22" T cells from T. spiralis infected rats, using the mouse IL-10 primers, produced a 682 bp 
cDNA which hybridized specifically to the internal oligo#3 of mouse IL-10. Ligation of this 
fragment into a pT7Blue plasmid followed by transformation into E. coli produced five clones 
with approximately 680 bp inserts which were partially sequenced to determine the orientation of 
the clones within the plasmids. Two clones (pIL10-30 and pIL10-39) which had been inserted in 
the o ppo site orie ntation relative to the multiple cloning site were sequenced completely using 
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Figure 1- Diagram of the cDNA clone, cDNA sequencing strategy, restriction enzyme sites and 
deduced protein coding region for rat IL-10. The full sequence of 682 bases includes a single 
open reading frame (hatched) with a translation initiation codon at base 2 and a termination 
codon at base 538. The first and last 20 bases of the cDNA represent mouse EL- 10. There are 
two Pst I (P) and one Eco RI (E) restriction sites in the cDNA. The positions of the three 
oligonucleotides are indicated. Sequence reactions were in the direction indicated by the arrows 
and represent reactions from pIL10-30 (o), which is oriented with the 5'-end at the T7 end of the 
multiple cloning site, and pIL 10-39 (*), in which the cDNA is oriented oppositely, with the 
5'-end near the universal-40 primer site. 



replicate sequencing reactions of each strand, with multiple gel loadings to produce a continuous 
sequence as indicated in Figure 1. The sequence determined for these two clones contained a 
single open reading frame of 537 bases beginning at base 2, followed by 144 bases of the 
3'-untxanslated region (rIL-10 in Fig. 2). Other than orientation, the only difference between the 
two clones is that pIL10-39 is missing the "T" overhang that should exist at the ligation site 
adjacent to the 5' -end of the cDNA. 

Comparison of the resulting open reading frame of this sequence with the coding region 
of mouse and human IL-10 sequences (1,3) indicated that it is 90% identical to the mouse and 
82% identical to human coding region sequences (Fig. 2). However, the first and last 20 
nucleotides of our cDNA actually represent mouse IL-10 and could differ slightly from the actual 
rat sequence. Comparison of the three species indicates unusually strict conservation of the 3' 
untranslated region in the IL-10 gene. As suggested by Moore et al. (1), there are a number of 
"AT" rich regions in the 3'-end of mIL-10 which may be involved in regulating mRNA stability 
and the most proximal of these is conserved in both sequence and position in rat IL-10 (Fig. 2, 
bases 654-658). In addition, significant regions of homology exist between the S'-untranslated 
regions of mIL-10 and ML- 10 beyond the extent of the rIL-10 cDNA (not shown), which include 
the "AUUUA" motif involved in regulated degradation of some mRNA's (16), suggesting that 
there is post-transcriptional regulation of IL-10 mRNA. 

The deduced amino acid sequence (Fig. 3) should be full length at 178 amino acids, based 
on the mouse and human sequences. The probable signal peptide cleavage site was predicted 
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base 

Met * * * * * * * * * 

rIL-10 CATGCCTGGCTCAGCACTGCTATGTTGCCTGCTCTTACTGGCTGGAGTGAAGACCAGCAAAGGCCATTCCATCCGGGGTGACAATAACTG 90 

mIL-10 C A....CA...G..T GG G.A..G AA 

hIL-10 ACA C G..C.C...A-...G....G.G CC GGG . .C..A.TC...G..C.G... 

//////oligoffl/////// — - /// 

* ****** ** 

rIL- 10 CACCCACTTCCCAGTCAGCCAGACCCACATGCTCCGAGAGCTGAGGGCTGCCTTCAGTCAAGTGAAGACrTTCTTTCAAAAGAAGGACCA 180 

tnIL-10 G G T C. .A C..G CA 

hIL-10 G..A..T.C.TA T T. .CCA. A CAG T T- . 

///oligo#3////// 

* *.* * * * * * * 

rIL-10 GCTGGACAACATACTGCTGACAGATTCCTTACTGCAGGACTTTAAGGGTTACTTGGGTTGCCAAGCCTTGTCAGAAATGATCAAGTrTTA 270 

mIL-10 A..C..C A A. .G C 

hIL-10 T.GT. .T.A.AG. .G G...G C T..G C 

********* 

rIL-1 0 CCTGGrAGAAGTGATGCCCCAGGCAGAGAACCATGGCCCAGAAATCAAGGAGCATTTGAATTCCCTGGGAGAGAAGCTGAAGACCCrCTG 360 

mIL-10 G T A. 

hIL-10 AG..G A..T A. A C C....G....C G C A. 

* ****** ** 

rIL-10 GATACAGCTGCGACGCrGTCATCGATTTCTCCCCTGTGAGAATAAAAGCAAGGCAGTGGAGCAGGTGAAGAATGArTTTAATAAGCTCCA 450 

mIL-10 . . .G.G. . . . A. G AAAA A G G GG. 

hIL-10 . C . GAG . . .A. .G T A..C..G C CC 

* * * * * *■* * End * 
rIL-10 AGACAAAGGTGTCTACAAGGCGATGAATGAGTTTGACATCTTCATOUVCTGGATAGAAGCCTAC 540 

mIL-10 . . . .C A A. . .A. . . TGA GC.A. .A 

hIL-10 . . .G CA A G A A A . G . . G . . ACG . . .C. . . GA 



rIL-10 CACCCG GCA T C T AC TGGAC TGCAGG ACAT AAAT - AG AG — C T - TC T AAATC TG ATC C AG AG A T C T TAGC T AAC 

mIL-1 . . . . T . CAGTGTGTATT .AG . . . G C C..G.C-....--..C G 

hIL-10 . . T.A. . GTGGCGAC . . . .-.A. . . .CT T . . . . GT . . -C . A G . . . . TG . G . C . . . GG . A . . G . 
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* * * * * AAA^A* * * 

GGGAGCAACTCCTTGGAAAACCTCGTTTGTACCTCTCTCCAA-AATATTTATTACCTCTGATACCTCAGTTCCC 
. .A. A G. - 



rIL-10 
mIL-10 
hIL-10 T.ACC..G.C 



AG -TA, 



TAT . G . 



//////oligo#2/////// 



Figure 2. Nucleotide sequence of rat IL-10 cDNA and comparison with mouse and human IL-10 
cDNAs. The complete nucleotide sequence of the rat IL-10 cDNA is shown, with the 
similarities (.) and differences (base code) and gaps (-) in alignment of the mouse and human 
cDNA sequences are indicated for the corresponding region of the cDNAs. The start codon 
(Met) and termination (End) are shown. Oligonucleotide positions (hatched) are indicated for 
the PCR primers (#1 and #2) and for the label used in Southern blots (#3)* A putative 
destabiluation signal is marked ( MAAA ). 
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ccc 



* * ■* * * * * ^ ^ 

MPGSALLCCLLLIAGVKTSKGHSIRGDNNCTHFPVSQTHMI^ q 0 

T.MRI.R.QYS.E G.S...L...T T M o 

V . . T . .RA.P.QGTQSE.S GNLPN . . . D . . D . . .R M L . . KE . . . E .......... .Q.. 



HS 



ccc 



. * * * * * * * * 

rILlOptn LVEVMPQAENHGPEIKEHLNSLGEKI^TLWIQLRRCHRFIJ^ 178 

^]"?-0P tn K RMR K S E.Q MMI. S 

nillOptn -E QD.D. .A.V N RLR A E. .1 S Y M.M.IR. 

Figure 3. Homology alignment of the deduced amino acid sequences of rat, mouse and human 
IL-10 peptides. The full-length deduced amino acid sequence of rat IL-10 is shown with the 
similarities in the mouse and human proteins indicated (.) as well as the differences. The most 
likely signal sequence clevage site is indicated (V) after amino acid 18, as are the potential 
N-linked glycosylation sites of mouse and rat IL-10 (ccc). 
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based on frequency calculations (17) to occur after amino acid 18, leaving a mature size of 160 
amino acids. The predicted molecular weight for the mature protein is 18.6 kDa however, there 
are two potential N-linked glycosylation sites (Fig. 3) which, if utilized in the native protein, 
would alter the molecular weight. The first potential N-glycosylation site is present in the rat 
and mouse but not human deduced protein sequences, whereas the second site is present in all 
three. The calculated pi of the mature rat protein is 9.1 , and the calculated pi values for mouse is 
8.7 and the human is 8.2. There are four conserved cysteine residues in the mature IL-10 peptide 
from all three species, and a fifth in rat and mouse, however the native proteins will have to be 
analyzed to determine if there are any internal or external disulfide bridges. Comparison of the 
deduced amino acid sequences of the three IL-lOs indicates that mouse and rat share 83% 
identity while human and rat have 73% identity (Fig. 3). Mouse and human IL-10 peptide 
sequences match at 72% of the amino acid residues. 

The intact mRNA of rat EL-10 is approximately 1700 bases as determined by the position 
of a single detectable hybridization band relative to the positions of 28S and 18S RNA bands in a 
Northern blot of cellular RNA from Con A stimulated OX8" OX22" T cells (20 |_tg RNA). This 
size is similar to that of hDL-10 mRNA (3). IL-10 mRNA was not detected by Northern blot 
analysis of 20 mg total RNA samples from brain, Peyer's patch, upper small intestine and 
freshly isolated thoracic duct T cells from T. spiralis infected rats (not shown). 

RT-PCR results analyzing RNA from freshly separated, Con A stimulated and T. 
spiralis antigen stimulated populations of thoracic duct CD8 + and CD8" T cells indicate that 
IL-10 mRNA is expressed in the Con A stimulated cells in far greater abundance than the other 
populations (not shown). Whether the protein is expressed in a similar pattern will be tested 
when a suitable mAb is prepared. 
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Preface 



Application of recombinant DNA techniques to biology- 
is bringing about a revolution in our understanding of 
living organisms. There is no field of experimental bi- 
ology that is untouched by the power we now have to 
isolate, analyze, and manipulate genes. When the first 
edition of Recombinant DNA was published in 1983, re- 
combinant DNA techniques were already being used 
extensively For the analysis of viral and bacterial genetics, 
but dissection, of eukaryotic genes was only just begin- 
ning. There were hints of what was to come. The concept 
of the gene as a continuous stretch of DNA had been 
shattered with the discovery of introns, but alternative 
splicing and genes- wi thin-genes were yet to be revealed. 
Identification of cellular oncogenes seemed to promise 
an understanding of cancer, but the mechanisms of their 
action — and the existence of tumor suppressor genes — - 
were still subjects for speculation. A handful of genetic 
diseases were being analyzed at the molecular level, but 
the isolation of the disease genes and the development 
of gene therapy were yet to come. 

Our. aim in writing the second edition of Recombinant 
DNA is to show how recombinant DNA techniques have 
led to the explosion in pur knowledge of fundamental 
biological processes. As in the first edition, which was 
subtitled A Short Course, we provide a concise presentation 
of the methods, underlying concepts, arid far-reaching 
applications of recombinant DNA technology. The field 
has grown since the publication of the first edition, and 
so has our book. But even though our previous subtitle 
may be inappropriate for this enlarged edition, our ap- 
proach to the material has remained true to the spirit of 
the "short course"; as before, the uninitiated will find 
access to the field of recombinant DNA here. 

The book is now divided into six major sections. The 
first five chapters, which are largely unchanged from 
the first edition, provide a historical introduction to the 



early development of recombinant DNA technology, up 
^ to the point when studies of eukaryotic organisms began 
in earnest. In the next section we describe in detail the 
methods currently used to clone and analyze genes, and 
devote an entire chapter to the polymerase chain re- 
action, which has had an extraordinary impact on re- 
search. The great power of recombinant DNA techniques 
comes from the ability to explore gene functions by 
manipulating genes and then introducing them back into 
ceils. The third section of the book discusses how this 
is done in mammalian cells, yeast, mice, and plants. The 
fourth section describes the progress these manipulations 
have allowed in key areas of biology. Here the range of 
recombinant DNA applications is demonstrated, from 
the analysis of cell cycle control and embryonic devel- 
opment, to the isolation of genes involved with brain 
function. Indeed, these techniques have spawned a whole 
industry — biotechnology. In the fifth section, we describe 
some of its accomplishments, including the development 
of genetically engineered pharmaceutical and agricul- 
tural products, and the studies of the human immuno- 
deficiency virus that are leading the attack on AIDS. The 
differences between the first and second editions are 
perhaps most evident in the final section, where we de- 
scribe the revolution in human molecular genetics* and 
the ways in which recombinant DNA techniques are 
providing new methods for diagnosis and treatment of 
human inherited diseases. 

The topics that are covered and the approach we take 
to describing them make this book suitable for under- 
graduate and graduate students in molecular biology, 
cell biology, biochemistry, genetics, or biotechnology 
courses; for medical students and physicians; and for 
others who have an interest in recombinant DNA tech- 
niques — for example, forensic scientists, patent attor- 
neys, and science journalists. 
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without describing the experiments that were done to 
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CHAPTER 




In Vitro 
Mutagenesis 




ecombinant DNA technology and DNA sequencing provided the tools 
%_.to clone and characterize genes. As we learned in Chapter 8, simple 
inspection of gene sequences told us. much about genomic organization. Func- 
tional sequences, such as transcriptional control elements, could often be iden- 
tified by comparing sequences of a number of genes. However, to delve deeply 
into the structure and function of genes required the ability to change the DNA 
sequence and examine the effect of the change on gene function. For decades 
before the advent of recombinant DNA, this was done by classical genetics, the 
identification of mutant organisms with new properties. From the generic prop- 
erties of mutants, information about the structure and function of the underlying 
genes could often be inferred. This approach, however, was limited to organisms 
in which simple genetic analysis was possible — bacteria, yeast, fruit flies. Genetic 
analysis of more complex, longer-lived organisms like mice and men was slow 
and difficult. 

Recombinant DNA changed all thaL The ability to isolate genes as molecular 
clones, the development of tools to modify gene sequences in the test tube, and 
the power to return altered genes to the organism to test their function have 
revolutionized the way genetics is done in higher organisms. Because we now 
often work "backwards" from gene sequence to gene function, in contrast to 
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FIGURE 11-1 

General strategy for an in vitro mutagenesis experiment 
Most procedures for in vitro mutagenesis follow the same 
basic scheme: Plasmid DNA is "mutagenized" in vitro then 
introduced into £ coli by transformation. Depending on the 
method, mutant clones can be isolated and tested individ- 
ually or a library of mutant piasmids can .be obtained 
which are tested using a generic screen. 



classical genetics, -this new approach spawned by re- 
combinant DNA is called reverse genetics. In this chapter 
we- will learn ways to alter the sequence of a cloned 
gene at will and how these methods are used to un- 
derstand the structure and function of genes and gene 
products. 

In -Vitro Mutagenesis. Is Used 
to Study Gene Function 

In vitro mutagenesis of cloned genes has become a 
standard tool in the functional analysis of nucleic acids 
and proteins. Most procedures follow the same basic 
scheme (Figure 11-1). Plasmid DMA containing the 
gene of interest is treated in vitro by some mutagenesis 
procedure that alters the DNA either chemically or 
enzymatically. The mutagenized plasmid DNA is in- 
troduced into £ coli by transformation, and colonies 
containing plasmid molecules are selected by anti- 
biotic resistance. Mutants can be made one at a time, 
or hundreds of different mutants can be created in a 
single mutagenesis experiment. Mutant piasmids can 
be isolated from single colonies and tested individ- 
ually. Alternatively, plasmid DNA can be prepared 
from pooled colonies and the resulting library tested 
en masse to identify mutant piasmids. 

The various approaches to mutagenesis can be 
grouped broadly into random and site-directed meth- 
ods. Random methods put mutations anywhere in a 
plasmid. They are best used to identify the location ■ 
and boundaries of a particular function within a cloned 
DNA fragment and are most readily used for this 
purpose when a simple genetic screen (or selection) 
is available. A genetic screen or selection consists of 
a system to test the function of the DNA of interest 
in cells without having to isolate each plasmid indiv- 
idually. Random mutagenesis is often used as a first 
step, when little is known about the function encoded 
by particular DNA fragment. Analysis of random mu- 
tants generally provides only a simple identification 
of the functional region but does not explain how 
things work on a molecular leveL The value of such 
a strategy is that it quickly helps to narrow down the 
focus of attention from a large DNA fragment to a 
smaller region that can be studied subsequently in 
greater detail. As we will learn, random mutagenesis 
can be accomplished by several different methods, 
such as altering the sequences within restriction en- 
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donuclease sices, inserting an oligonucleotide linker' 
randomly into a plasmid, damaging plasmid DNA in 
vitro with chemicals, or incorporating incorrect nu- 
cleotides during in vitro DNA synthesis. 

Once an important functional domain in a gene has 
been identified by random mutagenesis, site-directed 
methods — putting mutations precisely where they are 
needed — are used to define the role of specific se- 
quences. In addition, directed mutagenesis provides a 
powerful tool for the analysis of protein function, by 
allowing researchers to make specific and subtle 
changes in the structure of the protein. A number of 
strategies have been developed to construct site-di- 
rected mutants in vitro, but this type of mutagenesis 
is, best accomplished using s ynthetic oligonucleotides. 
With an oligonucleotide the desired sequence is sim- 
ply built into the wild-type framework. Nowadays, 
oligonucleocT3e-directed mutagenesis reactions are 
relatively straightforward, and oligonucleotides are 
cheap and easy to obtain! The limitation of site-di- 
rected mutagenesis is that you must already have 
enough information to know what you wish to change. 
There are two standard ways of using oligonucleo- 
tides to construct site-directed mutants: mutagenesis 
by gene synthesis and mutagenesis by enzymatic ex- 
tension of a mutagenic oligonucleotide. By using de- 
generate oligonucleotides (see Chapter 7) a set of 
"random" mutations at a specific site can also be made. 

Restriction Endomiclease Sites Provide 
the Simplest Access for Mutagenesis 

One of the first experiments done with a cloned DNA 
fragment is to map the positions of restriction endo- 
miclease cleavage sites in the DNA by using a battery 
of different enzymes. Although this information could 
be precisely obtained from the DNA sequence, map- 
ping restriction sites can be accomplished rapidly and 
is often done in conjunction with sequencing. Re- 
striction endonuclease recognition sites provide the 
simplest way to modify a DNA clone in vitro (Figure 
11-2). Cleaving plasmid DNA with a restriction en- 
zyme that recognizes only one site produces a linear 
molecule. This serves as an entry point for modifying 
the DNA sequence in the vicinity of the restriction 
site. For example, the enzyme EcoKl recognizes the 
sequence GAATTC and produces ends with 5' ov- 
erhangs. The ends can be made even (blunt) by treating 
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FIGURE 1L-2 

Creating a mutation by manipulation of a restriction site. 
Plasmid DNA is cleaved with EcoKl restriction endonu- 
clease, which generates a linear fragment with 5' ends that 
have four unpaired nucleotides (so-called sticky ends).- 
Treacment with SI nuclease (left) removes these nucleo- 
tides, and the linear fragment is then treated with DNA li- 
gase. The resulting circular molecule contains a deletion of 
4 bp. Alternatively, addition of DNA polymerase and deoxy- 
ribonucleotide triphosphates (dNTPs) to the plasmid cleaved 
by EcoR.1 extends the 3' ends by. DNA synthesis (right). 
After ligation, the resulting molecule contains an insertion 
of 4 bp. In both cases, the £coRI site has been destroyed. 
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FIGURE 11-3 

Linker insertion mutagenesis to map functional domains of a 
bacterial transposable element. The starting plasmid contains 
an intact transposon, an ampicillin-resistance gene for selec- 
tion.in £ coli, and sequences for plasmid replication. The 
UNA is treated with a low concentration of deoxyribonu- 
clease I in the presence ofMn J+ . Under these conditions 
the enzyme makes double.stranded cuts at random positions 
in the plasmid, generating a collection of linear DNA mole- 
cules broken at different positions. Oligonucleotide linkers 
encoding an £: 0 RI restriction site are added to the ends 
with DNA I.gase, the linear molecules are treated with 
EcoRI endonuclease to create sticky ends on the linkers, and 
the molecules are ^circularized. The circular molecules are 
transformed into £ coli, and ampicillin-resistant colonies are 
selected. Plasmid DNA is isolated from individual colonies 
introduced into another strain of £ col,, and tested for activ- 
ity of the transposon. The positions of the inserted linkers 
are mapped by restriction digestion. Linkers inserted in one 
region (blue) of the plasmid inactivated the transposon No 
linker insertions in the ampicillin-resistance gene were re- 
covered, because these plasmids would fail to yield a drutr- 
resistant colony in the original selection of transformed 
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the cleaved DNA with DNA polymerase in the pres- 
ence deoxyribormcleotide triphosphates. The two 
blunt ends can then be linked together again (ligatfed) 
by incubating the linear plasmid molecule with DNA 
hgase. A few nanograms of DNA from the in vitro 
ligation reaction is used to transform E coli, and the 
new modified plasmid is isolated from one of the re- 
sulting colonies. The net result of these manipulations 
is to insert 4 bp into the plasmid at the EcoRI site 
Alternatively, a small deletion mutation can be made 
by treating the linearized DNA with Si nuclease, 
which specifically digests single-stranded DNA This' 
creates blunt ends by removal of the four nucleotides 
that constitute the 5' overhang generated by EcoRI at 
each end. Subsequent ligation of the DNA into a co- 
valently closed circular molecule thus results in the 
deletion of 4 bp from the DNA. In each example, the 
new sequence no longer encodes the EcoRI recognition 
site. These types of manipulations, if done to a protein- 
coding sequence, would change the translation^ read- ' 
mg frame, resulting in production of a grossly altered 
protein. The major limitation of using restriction sites 
to make mutations is that there simply may not be 
sites in regions of the gene the experimenter wishes 
to alter. 
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LinEer Insertion Is Used to Map a 
Bacterial Transposon 

We have learned that it is a simple matter to cleave 
a plasmid with a restriction enzyme, blunt the ends 
by treatment with a DNA polymerase, and rejoin them 
by ligation. A variation on this technique is to rejoin 
the ends in the presence of a synthetic oligonucleotide 
"linker," often one that encodes a restriction site. In- 
sertion of the. linker disrupts the gene sequence; the 
position of the inserted linker can be easily mapped 
by cleavage of the plasmid with the restriction enzyme 
that cuts the linker. 

A similar method was used to define the functional 
regions of a bacterial transposable element (a "jumping 
gene," see Chapter 10), by inserting linkers at many 
alternative positions throughout the element. To place 
linker insertions in the transposon, a plasmid carrying 
a clone of the transposon was treated with a nuclease 
that cleaved the plasmids at random positions (Figure 
1 1-3). Cleavage conditions were adjusted so that each 
plasmid was cut just once on average. The linearized 
molecules were isolated and ligated into circles again 
in the presence of an 8-bp linker oligonucleotide con- 
taining an EcoR.1 restriction site, resulting in insertion 
of the linkers into random sites, one in each plasmid. 
The resulting plasmids were transformed into E. colt 
and, using a genetic screen, examined to see if the 
transposon could jump. Insertion of a linker into a 
region of the transposon critical for its function in- 
activates it, presumably by putting a protein-coding 
sequence out of frame. By mapping the positions of 
the inserted linkers by restriction analysis, the loca- 
tions of functional regions of the transposon were 
deduced. 



Construction of Nested Deletions Maps 
the Boundaries of a Transcriptional 
Control Region 

Transcription of the gene encoding the 5S ribosomal 
RNA molecule is carried out by RNA polymerase III 
(pol III, see Chapter. 8). To identify the sequences 
within the 5S gene required for transcription by pol 
III, a series of deletion mutations was made and tested 



for their abiliry co support accurate transcription. Two 
sets of deletions were made. One was made by cutting 
a plasmid carrying a cloned 5S gene at a restriction 
site on the 5' side of the gene. The linearized plasmid 
was treated with a combination of nucleases that di- 
gested away DNA from the ends of the molecule 
(Figure 11-4). The amount of DNA removed was 
controlled by varying the time, temperature, or en- 
zyme concentration in the reaction. A second set of 
deletions was generated from plasmid DNA cleaved 
at a site on the 3' side of the gene. The result was 
two sets of plasmids with progressively larger deletions 
toward the gene from both directions. Testing these 
genes revealed that only deletions entering a 35-bp. 
region within the transcribed region of the 5S gene 
abolished transcription by pol III. Therefore, this dele- 
tion analysis mapped the transcriptional regulatory 
element to this 35-bp stretch, which has subsequently 
been analyzed in much greater detail by site-directed 
mutagenesis. 

Several different types of enzymes can be used to 
produce deletions. Generally, these enzymes delete 
DNA from both ends of a linearized plasmid molecule. 
Often, however, one end of the molecule contains 
sequences that need to be retained in the plasmid 
because, for example, they are required for plasmid 
replication. In the 5S gene deletion experiment, this 
limitation was accommodated by isolating the deleted 
gene fragments and recloning them into a new vector. 
Alternatively, a strategy can be used that limits dele- 
tion- to one end of a linearized plasmid molecule (Fig- 
ure 11-5). This method is widely used to generate 
nested deletions for DNA sequencing (see Chapter 7). 



Linker-Scanning Mutagenesis Permits 
Systematic Analysis of Promoters 

Deletion mutagenesis of the 5S gene mapped the 
boundaries of the transcriptional control region in the 
gene. But not all the nucleotides within the boundaries 
of that 35-bp region are necessarily critical for func- 
tion. Therefore:, methods were needed to change in- 
dividual nucleotides in a target without generating 
cross deletions or other rearrangements. This was ac- 
complished for a viral promoter using an elegant ad- 
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FIGURE 11-4 

Construction of a nested set of deletion mutants to map the 
transcription control region of a 5S ribosomal RNA gene, 
(a) A plasmid clone was linearized with a restriction 'enzyme 
at a position (A) on the 5' side of .the gene. The linear frag- 
ments were treated with an exonuclease, which digests 
DNA from both ends of the molecule. Portions of the reac- 
tion were removed at different rimes to recover populations 
of molecules with progressively larger deletions. Linkers 
were added to the ends, and the molecules were cleaved 
with restriction enzymes specific for sites B and C to sepa- 
rate the 5S gene fragments from the remnants of the vector. 
The fragments were recloned into a new vector, generating 
the set of rightward deletion mutants. To create the left- 
ward deletion mutants, this process was repeated after 
cleaving the plasmid at restriction site B. (b) Individual plas- 
- mids were isolated after transformation, their deletion end- 
points determined by DNA sequencing, and their ability to 
support transcription by RNA polymerase III tested with an 
in vitro assay. As can be seen by comparing transcription 
activity with the extenr of deletion, transcription is inhibited 
when the rightward (5') deletions enter the +40 region and 
when the leftward (3') deletions pass the +80 point. This 
suggests that the transcription control region lies between 
+ 40 and +80. 
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aptation of deletion mutagenesis called linker scanning. 
Using the methods outlined in Figure 11-4, two sets 
of plasmids were constructed that contained deletions 
within the promoter. One set of deletions started from 
a site beyond the 5' end and proceeded toward the 
gene, leaving the 3' end intact; the other set started 
at a point within the gene and proceeded in the op- 
posite direction, leaving the 5' end intact. Each dele- 
don terminated with a 10-bp BamHl linker. The extent 
of the deletion in the DNA was determined for each 
plasmid by DNA sequencing. Pairs of plasmids from 
the two deletion sets with endpoints 10 bp apart were 
recombined at their BamHl sites (Figure 11-6). The 
effect was to preserve the length and organization of 
the promoter — thought to be important for promoter 
function — but to replace various 10-bp segments of 
wild- type promoter sequence with the sequence in the 
linker. Thus, this experiment created a library of pro- 
moter mutants of similar structure but with nucleotide 
substitutions clustered within 10-bp windows located 
at various sites in the promoter. This collection of 
mutants spanned the length of the promoter. The re- 
sults of this analysis were discussed in Chapter 9. At 
the time, this experiment represented the most thor- 
ough analysis of a promoter in a mammalian gene. 



FIGURE 11-5 

Construction of unidirectional deletions using exonuclease 
III. Exonuclease III attacks preferentially the 3' end of a lin- 
ear DNA molecule with 5' protruding nucleotides. There- 
fore, by cleaving a plasmid molecule at adjacent sites with 
BamHl, which leaves a 5 r overhang, and PrA, which leaves a 
3' overhang, only the end generated by BamHl is attacked 
by exonuclease IIL After exonuclease III treatment, the re- 
maining single-stranded tail (along with the overhang at the 
other end) is removed with Si nuclease, which digests only 
single-stranded DNA An oligonucleotide linker is attached, 
and the fragments are ligated to form closed circular mole- 
cules. In the experiment shown here, deletions are being 
used to map the functional domains of a cloned gene in- 
serted in an expression vector. This strategy allows dele- 
tions to be made onlyin the cloned gene, without damaging 
the promoter sequence. 



Random Nucleotide Substitutions Are 
Obtained by Chemical Modification of 
DNA or by Enzymatic Misincorporation 

While linker scanning allows the creation of nucleo- 
tide substitutions, each mutant generally contains sev T 
eral substitutions, and the positions of the mutations 
depend on the availability of appropriately placed 
deletions. Therefore, several strategies have been de- 
veloped for placing single nucleotide substitutions at 
random positions in a DNA molecule. The simplest 
methods employ chemicals that modify or damage 
DNA. Generally, plasmid DNA or DNA fragments 
are treated with chemicals, transformed into £ coli, 
and propagated as a library of mutant plasmids. Chem- 
icals most commonly used for in vitro mutagenesis 
include sodium bisulfite, which deaminates cytosine 
residues to uracil, and reagents that damage or remove 



i yd 



|; C H A PTE R ] i 



Cleave with Bg/ll 
Digest with exonuclease 
Ligate BamHl linker g 
Circularize 




TK codi ng sequence 



TK promoter 
{140 bp) 



3' deletion 
mutants 



H/ndlll 




Cleave with H/ndlll 
Digest with exonuclease 
Ligate BamHI linker ggi 
Circularize 



8amHI 




H/ndlll 



H/ndlll 



Promoter 
fragment from 
Piasmid C 




Promoter 
fragment from 
Piasmid Z 

Ugate 




BamHI 



5' deletion 
^ mutants 



Recombinant piasmid 
%/ll 



Linker-scanning 
mutant 



Wild-type 
sequence 




i m i in 




FIGURE 11-6 • 

s L ^ k oTHM anning mUCa S enesIs of the viral Promoter for the thymidine kinase (TK) gene Two 
he nro k T*™ m " de h ^™"g ^ restriction sites on the 5' and f' sides of 

o ors to mike A me *° d df ? cribed " ^ (The P"™*" 15 divided ^ 

SrseouTncedt K I" 0 ' 6 A PP™<— ly one hundred plasmids 

Tdelerion of o- ^ ^ ^ endp ° mC - Pal " of deledon figments, where the 

delet on of L aJ'T™ P^sely 10 bp downstream from- the endpoint of the 3' 

aeieuon of the other fragment, were identified. The Hirdlll-BamUl fragment of the 3' dele- 

Zml^Ctt T? 1 -^ 1 fragment ,° f 5 ' ddeti0n ™ Were joined theft 
o« sh^ hf K T d *-?r PlaSmid - ThiS ^ lds molecul - "ke the 

10 bn Sr HnL W ' ld " type i n SeqU£aCe eXC£ P r for rhe ^bsticution of the 

eLmphTIhown A 5 m P f SeqUe ? Ce ^ tWCen thC tWp ddetion end P° ints " [n che 

example shown, this results .n a cluster of eight nucleotide substitutions (arrows) 



H/ndH) 




Prepare 
jingle- stranded 
DNA 



EcoRI 



Double-stranded plasmid 



Cytosrne 




FIGURE 11-7 

Chemical mutagenesis using sodium bisulfite. Sodium bisul- 
fite reacts with cytosine bases of single-stranded DNA to 
convert them to uracil, a thymine analog that base-pairs 
with adenine. Single -stranded DNA is treated with sodium 
bisulfite to modify a small number of cytosine residues in 
each molecule. An oligonucleotide primer is annealed to the 
DNA and serves as a primer for synthesis by DNA poly- 
merase. When the polymerase encounters a uracil in the 
template strand, it incorporates an adenine into the newly 
synthesized DNA Since the vector sequences are also dam- 
aged by bisulfite treatment, it is necessary to excise the dou- 
ble-stranded DNA fragment by restriction endonuclease 
cleavage and redone, it into an undamaged vector. Follow- 
ing transformation into E. coli> a library of mutant plasmids 
can be isolated or individual plasmids can be purified and 
tested The average number of substitutions in the DNA 
fragment can be controlled by altering the conditions of bi- 
sulfite treatment. 



bases, thereby preventing normal Watson-Crick base- 
pairing (these include hydrazine and formic acid, 
which are used in Gilbert-Maxam DNA sequencing, 
Chapter 5). Most often, chemical mutagenesis is per- 
formed on single-stranded DNA and followed by in 
vitro synthesis of the complementary strand using a 
DNA polymerase (Figure 11-7). This synthesis in- 
corporates the mutation into the new strand. In DNA 
treated with bisulfite, an adenine nucleotide is incor- 
po raced opposite the uracil; after transformation into 
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R- coli, the wild-type OG base pair becomes a TV A 
pair. In DNA treated with reagents that eliminate 
bases, any nucleotide can be incorporated opposite the 
"abasic" site, which still retains its deoxyribose back- 
bone although it has lost its base. The major limitation 
of chemical mutagenesis is the specificity of the in- 
dividual reagents: bisulfite mutagenesis, for example, 
changes only cytosines. 

AU possible nucleotide substitutions can be gen- 
erated using enzymatic misincorporation. Here the 



FIGURE 11-8 

Oiigonucleotide-directed mutagenesis by enzymaric primer 
extension. A "mutagenic" oligonucleotide encoding the de- 
sired mutation embedded in wild-type flanking sequence is 
annealed to a single-stranded DNA template. The sequence 
of the oligonucleoude is complementary to the template ex- 
cept for the nucleotides that define the mutation. Generally, 
the mutagenic oligomer is designed so that the mismatched' 
nucleotides are positioned in the middle and there are at 
least 8 to 12 nucleorides on either side chat base-pair with 
the template DNA. The muragenic oligonucleotide serves as 
a primer for DNA synthesis by PNA polymerase. Once rhe 
entire template has been copied, the ends of the newly syn- 
thesized strand are covalently linked by DNA ligase. The 
heteroduplex DNA is transformed into K coli Theoretically, 
both strands can replicate, segregating into separate mutant ' 
and wild-type plasmids. In practice, however, most colonies 
contain only one or the other, because enzymes in the cell 
recognize and repair mismatched nucleorides in the hetero- 
duplex before replication. Plasmid DNA is isolated from the 
resulting colonies and is screened to identify mutants. 



strategy is to perform in vitro DNA synthesis under 
nonideal conditions — suboptimal ionic conditions, 

unbalanced concentrations of nucleotide precursors 

that encourage DNA polymerase occasionally to in- 
corporate the wrong nucleotide during synthesis. For 
example, synthesis is carried out in the presence of 
high concentrations of three of the precursors and a 
very low concentration of the fourth. At positions that 
normally call for the fourth (scarce) nucleotide, one 
of the others is sometimes incorporated instead. These 
methods also exploit DNA polymerases that lack a 
proofreading activity — a 3' to 5' exonuclease mech- 
anism that checks each base pair after incorporation 
and removes nucleotides that are mismatched. Thermus 
aquaticus (Taq) DNA polymerase, used in the poly- 
merase chain reaction (Chapter 6), lacks such an ac- 
tivity. Though this is a problem when accuracy of 
synthesis is required, the PCR is a very simple and 
efficient way to introduce random nucleotide substi- 
tutions into a DNA fragmenL 

A general problem with random mutagenesis ap- 
proaches is that they often produce mutants with more 
than one substitution. Multiple substitutions in a single 
mutant complicate the interpretation of an experi- 
ment, because it isn't clear which substitution (or* 
which combination of substitutions) is responsible for 
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observed changes in die properties of the mutant. Ex- 
traordinary methods have been used to circumvent 
this problem — essentially, significantly reducing the 
extent of mutagenesis and using enrichment protocols 
to find rare mutants — but almost all these procedures 
have been supplanted by new methods that use syn- 
thetic oligonucleotides. 



Synthetic Oligonucleotides 
Facilitate Mutagenesis 

Most of the methods for mutagenesis we have dis- 
cussed so far have some significant shortcoming — they 
rely on fortuitous zccess. to a sequence via a restriction 
site, forced entry through deletion strategies, or te- 
dious screens to find randomly generated mutations 
in the region of interest. To be most powerful, mu- 
tagenesis must allow the experimenter to place any 
modification at apposition desired in cloned UJNA. 
This has bec ome not only possible, but simple and 
cheap, with the advent of synthetic DNA oligonu- 
cleotide s. Oligonucleotides provide the means to de- 
sign a p articular ^mutation and then to place it precisely 
where you want it. ■ . "~ 

The simplest method for doing oligonucleotide- 
directed mutagenesis is by enzymatic primer extension 
(Figure .11-8). In this method, an oligonucleotide is 
designed that carries the mutation flanked by 10 to 
15 nucleoddes of wild-type sequence. This "muta- 
genic" oligonucleotide is hybridized to its comple- 
mentary sequence in single-stranded wild-type DNA 
prepared from a phage or phagemid clone, forming a 
heteroduplex with mismatched nucleotides at the site 
of the mutation. Although the oligonucleotide is not 
perfecdy complementary, it will anneal if the hybrid- 
ization conditions are hot very stringent The oligo^ 
nucleotide serves as a primer for in vitro enzymatic 
DNA synthesis by a DNA polymerase that converts 
the single-stranded DNA into double-stranded form, 
using the wild-type strand as template. In this way, 
all regions of the plasmid except the region containing 
the mutagenic oligonucleotide will be wild-type in 
sequence. Once the primer has been extended com- 
pletely around the terrfplate, the ends of the newly 
synthesized strand are ligated, forming a double- 



stranded circular DNA molecule.' This heteroduplex 
DNA — one strand has the wild-type sequence and the 
other strand has the mutant sequence — is transformed 
into R colt, where either strand can be replicated. By 
the time a colony grows up, however, it usually con- 
tains only one type of plasmid, wild-type or mutant 
The types of mutations that can be made by this ap- 
proach range from single nucleotide substitutions to 
deletions or insertions, limited only. by the size of the 
oligonucleotide needed 



Mutant Clones Can Be Identified by 
Hybridization and DNA Sequencing 

Theoretically, half the daughter molecules of a mu- 
tagenesis reaction will be wild- type and half mutant 
In practice, however,. the precentage of mutant plas- 
mids is often much lower. This is due to a variety . of 
technical factors, but the consequence is that methods 
for identifying or enriching mutant clones are vitaL 
Mutant molecules can be distinguished from wild- type 
if there is gain or loss of a restriction site. Alternatively, 
the oligonucleotide that was originally used to make 
. the mutation can be used as a hybridization probe to 
distinguish mutant from wild^type molecules (Figure 
1 1-9). The mutagenic oligonucleotide is radioactively 
labeled with 32 P-ATP and hybridized to DNA from 
bacterial colonies on nitrocellulose filters, as described 
in Chapter 7. If the temperature of the hybridization 
is raised in 5 or 10°C increments, a point can usually 
be reached at which the labeled oligonucleotide will 
hybridize only to the mutant molecules (to which it * 
is perfectly complementary) and not to the wild-type 
molecules, because the hybrid is destabilized by the 
mismatched nucleotides. Plasmid DNA is isolated 
from an R coli colony that strongly hybridizes to the 
probe. Verification that the desired mutation was made 
is accomplished by sequencing the DNA of this pu- 
tative mutant clone. This technique can identify one 
mutant clone among several hundred wild-type clones. 

Several clever methods enrich for mutant clones so 
that the tedious. task of screening by hybridization is 
not necessary. In one of these techniques, the template 
DNA is biologically marked so that it is destroyed 
after transformation into R coli and the mutant strand 
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FIGURE 11-9 

Searching for mutant plasmids using the mutagenic oligonu- 
cleotide as a probe. Colonies (or plaques) resulting from 
transformation by mutagenized plasmids (see Figure 11-8) 
are prepared for colony hybridization on nitrocellulose fil- 
ters using methods described in Chapter 1. The mutaaenic 
oligonucleotide is radioactively labeled by phosphoryfating 
its 5' end using 32 P-ATP and polynucleotide kinase. The la- 
beled oligonucleotide is hybridized to the plasmid DNA on 
the nitrocellulose niters. At low temperature, the oligonu- 
cleotide will hybridize to both mutant and wild-type DNAs. 
As the temperature is increased, the mismatched oligonu- 
cleotide hybridized to the wild-type plasmid DNA begins to 
dissociate from the wild-type clones. Eventually a tempera- 
ture is reached at which the mismatched oligomers com- 
pletely dissociate from the wild-type clones but remain 
hybridized to the mutants. Since the oligonucleotide is ra- 
dioactively labeled, the nitrocellulose filter is exposed to 
x-ray film and mutant clones are identified by the presence 
of a strong signal on the autoradiograph. Mutant plasmid 
DNA is then isolated from the corresponding colony on the 
master plate, using the replica filter as a guide. " 



is preferentially replicated (Figure 1 1-10). In a second 
method, the template strand is enzymatically de- 
stroyed before transformation. Both methods can yield 
mutants at a frequency of greater than 50 percent, so 
that plasmid DNA is simply isolated from three or 
four randomly picked colonies and analyzed by DNA 
sequencing with the expectation that a mutant will be 
found among the DNA selected. 



Oligonucleotide Cassettes Provide a 
Simple Method for Introducing 
Directed Mutations 

We learned earlier that restriction enzyme sites pro- 
vide access to a cloned DNA for mutagenesis. If two 
restriction sites are close together, the intervening 
fragment can be removed and replaced with a synthetic 
double-stranded fragment (a cassette) made from two 
complementary single-stranded oligonucleotides car- 
rying any desired sequence. Often, however, conve- 
nient restriction sites are not available; fortunately, it 
is a simple matter to create them using the oligonu- 
cleotide-directed mutagenesis procedures described in 
the previ6us sections. Once the sites are in place, any 
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FIGURE 11-10 

Enrichment for oligon'ucleotide-directed mutants by using a 
uracil-containing template. Single-stranded template DNA is 
prepared in a strain- of & coli that lacks the enzyme uracil _ 
deglycosidase (ung"), so that it contains several uracil resi- 
dues in place of thymines. (Although uracil is not usually 
incorporated into DNA, it is not actually mutagenic and it 
does form a base-pair with adenine.) The mutagenic oligo- 
nucleotide is annealed and primes the synthesis of a strand 
that extends around the template in a reaction using the 
four standard dNTPs (as in Figure 11-8). Following ligation, 
the heteroduplex DNA molecules are introduced into an 
ung + strain of £ coli Once in the cell, the wild-type (tem- 
plate) strand is attacked by uracil deglycosidase, which 
causes breaks in the DNA strand, and the DNA strand is 
degraded before it can be replicated. Since the strand con- 
taining the mutagenic oligonucleotide does not contain ura- 
cil it is not attacked and is replicated normally. When this 
procedure is used, 50 percent or more of colonies contain 
mutant plasmids. 



number of new mutants can be made by inserting 
synthetic fragments into the plasmid (Figure 11-11), 
just as different cassettes can be inserted into a tape 
player. 

This method of cassette mutagenesis was the basis 
for an elegant experiment that verified a structural 
model for DNA recognition' by phage repressors. The 
repressors of the A-like phages 434 and P22 contain 
a helix- turn -helix structure (see Chapter 9) that rec- 
ognizes the operator DNA in the phage genome. It 
was hypothesized that amino acid side chains on one 
face of an a helix in the. repressor protein make se- 
quence-specific contacts with operator DNA. To test 
this hypothesis, a helix swap was performed (Figure 
11-12). Oligonucleotides were synthesized that en- 
coded the amino acids of the helix in the 434 repressor, 
with the five positions thought to contact DNA 
changed to those found in the P22 repressor. This 
synthetic fragment was swapped for the natural frag- 
ment in the 434 gene. The resulting hybrid protein 
gained the recognition specificity of the P22 repressor, 
demonstrating that this helix indeed contacts the 
DNA. 

Cassette mutagenesis with degenerate oligonucleo- 
tides can be used to create a large collection of random 
mutations in a single experiment. This method was 
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used to study the structure of the glucocorticoid re- 
sponse element (GRE), an enhancer- sequence that 
activates a family of genes in response to certain steroid 
hormones. The element had been mapped by deletion 
mutagenesis to a 30-bp region in a glucocorticoid-' 
regulated gene. To define the sequence required for 



FIGURE 11-11 

Mutagenesis by cassette replacement Plasmid DN'A is 
cl«ve d w lt h restriction enzymes EcoRI and HMUl, which 

de.vLT^'f flaDk Ae Se< * Uence » be ™» C ed. The small 
cleaved DNA fragment coining a portion of the wild- 
type sequence .s removed, and a DNA fragment (cassette) 

S f »»«*» » lifted Lo the pEi 

This mutant DNA fragment is composed of two comple- 
mentary synchenc oligonucleotides that have EcoRI and 
and III sucky ends when annealed. Because there is no het- 
eroduplex mtermedjate— the mutant cassette is simply 
swapped for the wild-type fragment-the recombinant plas- 
nuds are alI e A mutant cassette can be composed of 

l1b«r o f e ° hg0nU f le0ti J des <«* Chapter 7), resulting in a 
library of mutant plasmids containing different sequences 



m 

GRE funcrion precisely, single point mutations 
throughout the 30-bp region were generated and 
tested , n cells for inducibility by glucocorticoid hor- 
mone. Two complementary oligonucleotides were 
synthesized that carried the 30-bp GRE, but synthes,s 
was performed under conditions in which incorrect 
nucleotides were incorporated at a low frequency (Fig- 
ure 11-13). These "doped" oligonucleotides (that is 
oligonucleotides produced by. doping; see Figure 
1 1-1 3) were annealed and inserted as a cassette into 

a promoter that lacked a GRE. Using this method 
• most smgle-nucleotide substitutions at the 30 positions 

were obtained. Such a collection of mutants would 
• have been unthinkable before oligonucleotides revo- 

Iudonized in vitro mutagenesis. 

Gene Synthesis Facilitates Production 
of Normal and Mutant Proteins 

The oligohucleotide-directed mutagenesis methods 
we have described use a single oligonucleotide or 
pair of complementary oligonucleotides to insert 
tant sequences into an otherwise natural DNA frag- 
menL Wich the ^creasing availability of longer 
Pljgonudepp des.Kis now feasible" to "assemb le an r n : 
tire, gene, fro m.synthetic units. Thj^ jUfTtfT^- 
thesizing a se t_of oligonucleotides, typically 4 0 rn fin' 
nucleotides in length, that can be annealed aridhW H 
in_yitro to assemble an entire double-strandM HXU 
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FIGURE 1 1-12 

The helix swap experiment: Amino acids in the phage 434 repressor protein believed re- 
sponsible for recognition of the 434 operator were changed by cassette mutagenesis (Figure 
11-11) of 434 DNA to the amino acids believed to perform the same function in an analo- 
gous region of phage P22 repressor protein, (a) Expression in £ colt of the 434 repressor 
protein (left), with an enlargement of the site believed to bind the 434 operator; (right) the 
corresponding section of the P22 repressor protein, (b) A cassette was synthesized resembling 
the 434 domain, but with P22-type substitutions at positions thought to be essential for re- 
cognizing P22 operator DNA. This was ligated into the digested 434 plasmid, and the re- 
combinant vector was introduced into £ coli to produce the hybrid protein, which then 
recognized P22 operator DNA but not the 434 operator. 
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FIGURE 11-13 

a C ceTu™ C2geReS1S USbg d °? ed '°^onucleorides ra gener- 
ate numerous mutants m a single experiment An oli^onu- 
c eotide cassette encoding the glucocorticoid respond 

th^ri^ T theSiZed by 3 DNA ma- 
chine Synthe SI s was done under conditions in which each 

- bottle containing a particular nucleotide precursor was 
contaminated (doped) with small amounts of the other 
three precursor, In the example above, the DNA synthes- 
>zer was instructed .to make an oligonucleotide with Se- 
quence GGTTACAAACT; Thus /When a nucleotide 

IddTlZ- 5 f l° r ~f ° f ° r «»™Pl«-*e machine 
adds an aliquot of the solution from the C bottle and a C 

chls S H UPled t0 K thC ° f m ° SC ° f oligonucLotide 
chains. However, because the C bottle contains a small 

' S" £ j G , T« an in — base is sometimes 
added instead. Since the concentration of C is roughly 30 
omes that of A C and T, an incorrect base will £ faddS to 
about 1 out of 30 molecules. This results in a doped collec 
non of ohgonucleotides, which actually consists o P f many 

dons rVfrrP S ° me WM ^ & and SOme Ora- 
tions, The evel of contamination was adjusted ro favor svn- 

tZ S ° f °^° nucI -^es with only one Jubsrimriot but^ 
because subsaruoons occur randomly, some molecules in 
the collection had none and others had two or ^ore Cas- 
settes were formed by annealing complementar^Zed oli- 
gonucleoudes and ligated into .* vector. Plasnu^DKU was 
SeThv ^. ^dividual E coU transforxnants and ana- 
yzed by sequencing. Of these, 224 were wild-type 218 con- 

the 90 possible single substitutions were recovered) and the 
rest contained two or more. A 
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^E^I^ n-H). In gene synthe sis the ex- 
, penrnencer hasTotal. control overrt^c^,^ ^ r 
gene fccan be^vild^ D e. or mutant in \ nv 
_°m£ed. Because most amino acids are encoded by 
multiple tnplet codons, genes encoding wild-type pro- 
teins can be constructed using different. codons Co- 
doas can be chosen to place unique restriction sites 
throughout the sequence so that mutant cassettes can 
be easily swapped in. Tins was done with the bacterial 
rhodopsm gene. Replacing a fragment of the synthetic 
gene with a new synthetic fragment identified the 
amino aad that is linked to the photon-absorbing 
chromophore that initiates photosynthesis. Other frag- 
ments can be exchanged as cassettes to study other 
important structural features of the protein. 

Codons can also be changed by gene synthesis to 
allow production of proteins at high levels in other 
organisms. Studies of the biochemistry of the Fos pro- 
tein, encoded by a cellular protooncogene in animal 
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FIGURE 11-14 

Gene synthesis by ligation of complementary oligonucleo- 
tides. To synthesize a gene that encodes a protein of inter- 
est, a set of overlapping complementary oligonucleotides are 
designed that can be combined to form a clouble-stranded 
DNA molecule that encodes the entire protein. The oligo- 
nucleotides are mixed together, heated at 90°C for a few 
minutes to denature the strands, and then cooled slowly to 
room temperature. During this period the oligonucleotides 
anneal through complementary base pairs. The oligonucleo- 
tides are designed so that each one anneals to two adjacent 
oligonucleotides from the opposite strand, bridging them. 
Generally, oligonucleotides ranging in length from 40 to 80 
nucleotides are used in gene synthesis. The annealed oli- 
gonucleotides are covalently linked by DNA ligase, produc- 
ing two contiguous DNA strands. This synthetic gene is 
usually purified from a gel before ligation into a vector. The 
resultant recombinant plasmid is obtained following trans- 
formation into R coli and is sequenced to check that the 
correct sequence was synthesized. The sequence of the syn- 
thetic gene can be designed to place restriction sites at con- 
venient locations for cassette mutagenesis. 



cells (Chapter 18), have been severely hampered by 
the inability to produce the protein in K coli This 
problem was finally solved by synthesizing a portion 
of the fos gene entirely from oligonucleotides, chang- 
ing natural fos codons to the codons used most effi- 
ciently in E coli Insertion of this synthetic gene into 
an K coli expression vector allowed for the first time 
the production of large quantities of active Fos protein. 
The gene was also designed with several unique re- 
striction sites so that efficient cassette mutagenesis can 
now be coupled to the biochemical assays for Fos 
function. 

The PCR Can Be Used to Construct 
Genes Encoding Chimeric Proteins 

The ease with which mutations can be made in a 
protein coding sequence has revolutionized the study 
of protein function. A functional domain can be iden- 
tified by making a series of mutant proteins, then 
testing which substitutions cause a change in function. 
However, it is not often easy to decide where to make 
a mutation. In the example of the helix swap exper- 
iment (Figure 11-12), the domain that bound DNA 
had been previously identified. And the design of the 
experiment was guided by having a model for the 
three-dimensional structure of the repressor protein. 
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However, for most proteins, little structural infor- 
mation is available. Identifying a functional domain — 
for example,, a region of the protein that may interact 
with another protein — is difficult to do by inspecting 
the primary amino acid sequence. A simple strategy 
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that helps to narrow down important amino acids in 
a protein is the analysis of chimeras between related 
proteins. We have previously discussed the use of 
computer programs to identify related proteins by 
comparison of their amino acid sequences (Chapter 
8). Chimeric proteins are constructed by replacing a 
segment of one protein with the homologous segment 
from another protein. Although the two proteins have 
functional differences, their sequence similarity often 
indicates that they share a common overall structure. 
A striking example of this was in the analysis of human 
growth hormone (hGH). A series of chimeric proteins 
were made in which most of the amino acids were 
derived from hGH but which contained segments from 
related hormones, such as human prolactin. Using this 
strategy, regions of hGH that interact with the hGH 
receptor were identified. In Chapter 17, we* will see 
how functional regions of a receptor which spans the 
membrane seven times were identified by the study 
of chimeras. 

The 434/P22 repressor (Figure 11-12) and hGH 
chimeras were constructed by ligation of short oli- 
gonucleotide cassettes into the coding sequence. A 
different strategy (Figure 11-15) was used to prepare 
a chimeric antibody in which a 400-bp segment from 
a yl heavy-chain gene was replaced by the homolo- 
gous segment from a y2b gene. A 400-bp DNA frag- 
ment was generated by PCR that encoded the new 
sequence to be inserted and two 30-base "sticky feet" 
on each end. The double-stranded PCR fragment was 



heated to denature the two strands, and then one of 
the single-stranded molecules was utilized in a primer- 
extension experiment (as in Figure 11-8). Had the 
gene synthesis method been employed, construction 
of the chimeric gene would have required twenty 40- 
nucleotide-long oligomers. Instead, the sticky feet 
method used only two oligonucleotide primers for 

PCR: 

Mutagenesis Is the Gateway to Gene 
Function and Protein Engineering 

It would be difficult to overestimate the importance 
of in vitro mutagenesis techniques to biology and bio- 
technology. The harnessing of enzymes that operate 
on DNA and the refinement of oligonucleotide syn- 
thesis have made changing gene sequences an almost 
trivial task. And the ability to operate on DNA lets 
us also change the structure of the products of genes — 
RNA and, most importantly, proteins. Thus, the im- 
pact of this technology is twofold. It has revolutionized 
how research is done in molecular biology by creating 
the entirely new concept of 'reverse genetics" — 
changing gene sequence first, then examining gene 
function. And it opens the door to sophisticated protein 
engineering (see Chapter 23), the ability to make 
changes in natural gene products that make them do 
their jobs better. The impact of protein engineering 
on medicine and industry will be substantial. 
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Recombinant DNA 
in Medicine 
and Industry 



s soon as the first successful cloning experiments were reported in 1973, 
applications for this powerful technology quickly followed. The signifi- 
cance of being able to produce large quantities of human proteins that were 
normally available in exceedingly small amounts, if at all, was not lost on 
scientists, physicians, and businessmen alike. In 1976 biotechnology became a 
reality as the methodologies for DNA cloriing, oligonucleotide synthesis, and 
gene expression converged in a single experiment, in which a human protein 
was expressed from recombinant DNA for the first time. The protein was 
somatostatin,- a 14 amino acid peptide neurotransmitter. The gene encoding 
somatostatin was not the natural gene but was synthesized chemically and cloned 
into a plasmid vector for expression in £ coli Soon after followed the successful 
expression of human insulin for the treatment of diabetes, the first commercial 
product of the biotechnology industry. Instead of insulin extracted from the 
pancreases of pigs and cows, diabetics could now receive insulin identical to 
that normally produced by humans. 

The ability to achieve such feats relied on the successes in all areas of molecular 
biology, including oligonucleotide synthesis, isolation of enzymes that cleave 
and join DNA, characterization of bacterial plasmids, and an understanding of 
gene expression. These methods have* of course, revolutionized research in 
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biology and medicine, but what is equally important, 
they have spawned an entirely new industry, one de- 
voted to the cloning and production of proteins of 
importance to both medicine and industry. Today, 
proteins are produced through recombinant DNA 
technology for treatment of numerous diseases— can- 
cer, allergies, autoimmune disease, neurological dis- 
orders, heart attacks, blood disorders, infections, 
wounds, and genetic diseases— as well as for more 
prosaic tasks, such as use in laundry detergents and 
food production. In addition, entirely new approaches 
to drug design have emerged from recombinant DNA 
technology, as scientists have gained the ability to 
tinker with natural proteins to improve their function 
and to change them in subtle and useful ways. 



Expression Systems Are Developed to 
Produce Recombinant Proteins 

Cloning the gene or cDNA encoding a particular pro- 
tein is only the first of many steps needed to produce 
a recombinant protein for medical or industrial use. 
The next step is to put the gene into a host cell for 
production. The development of expression systems 
has been an important research area in both industrial 
and academic laboratories. The most popular expres- 
sion systems are the bacteria R coli and Bacillus subtilis, 
yeast, and cultured insect and mammalian cells. We 
have learned in earlier chapters about the development 
of vectors and DNA transformation methods for these 
organisms. Here we will discuss the issues that are 
important for protein production. The choice of which 
cell is used depends on the project goals and on the 
properties of the protein to be produced. 

Bacterial cells offer simplicity, short generation 
times, and large yields of product with low costs And 
especially with B. subtilis, the cells can be induced to 
secrete the product into the culture medium, thus 
greatly simplifying the task of purification. But expres- 
sion in prokaryotic cells has several drawbacks. Al- 
though some proteins are expressed to" high levels 
(greater than 10 percent of the mass of all bacterial 
proteins), they often fail to fold properly and hence 
form insoluble inclusion bodies. Protein extracted from 
these inclusion bodies is often biologically inactive 



Small proteins can sometimes be refolded into their 
acnve forms, but larger proteins usually cannot A 
second problem is that foreign proteins ate sometimes 
toxic to bacteria, so cell cultures producing the protein 
cannot be grown to high densities. This problem can 
often be circumvented by using an inducible promoter 
• that is turned on to begin transcription of the gene 
for the foreign protein only after the culture has been 
grown. Third, bacterial cells lack . enzymes that are 
present in eukaryotic cells and add posttranslational 
modifications, such as phosphates and sugars, to pro- 
teins. These modifications are often required for 
proper functioning of proteins. Researchers are ad- 
dressing this problem by purifying the eukaryotic en- 
zymes that carry out these modifications and using 
these enzymes to add the needed modifications to 
bactenally expressed proteins. 

Yeast has been used for centuries by brewers and 
bakers, and now it toils for biotechnologists as well 
As discussed in Chapter 13, yeast is a simple eukaryote 
that resembles mammalian cells in many ways but can 
be grown as quickly and cheaply as bacteria can. Yeast 
perform many of the posttranslational modifications 
found on human proteins and can be induced to secrete 
certain proteins into the growth medium for harvest- 
ing. A disadvantage of yeast is the presence of active 
proteases that degrade foreign proteins, thereby re- 
ducing the yield of product. Researchers are dealing 
with this problem, however, by constructing yeast 
strains m which the protease genes have been deleted. 

Expression of heterologous proteins in insect cells 
by baculovirus vectors (as previously described in Fig- 
ure 12-12) is a relatively new approach. The main 
advantages are high-level expression, correct folding 
and posttranslational modifications similar to those in 
mammalian cells. A vaccine for the AIDS virus has 
been prepared by producing one of the HIV glyco- 
proteins with this system. Although the cost of cul- 
tunng msect cells is currently more than that for 
culmnng bacteria and yeast, it is less than that for 
cultunng mammalian cells. 

Despite the significant advantages of producing hu- 
man proteins in heterologous host cells, in some cases 
the best place to produce a mammalian protein is in 
mammalian cells. Great improvements have been 
made to promoters, vectors, transformation protocols 
and host cell systems. Transient expression in mam- 
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malian cells (described in Figure 12-4) is often used 
for checking the function of a newly cloned gene and 
as a quick method for assessing the function of en- 
gineered proteins. The extracellular domains of cell- 
surface receptors (Chapter 17) have been engineered 
for secretion from cells by introducing a stop codon 
into the gene before the transmembrane domain se- 
quence. These soluble receptors are valuable reagents 
for studying ligand binding in vitro and for screening 
for receptor agonists or antagonists, and they may 
eventually be used as therapeutics themselves. Al- 
though transient systems yield enough protein for lab- 
oratory experiments, stably integrated amplified genes 
in mammalian cells are used for the large-scale pro- 
duction of proteins such as tissue plasminogen acti- 
vator, which we describe later. 



Insulin Is the First Recombinant Drug 
Licensed for Human Use 

The fi rst licensed drug produced through genetic en- 
gineering was human insulin. An important hormone 
that regulates sugar metabolism, insulin is produced 
by a small number of cejls in the pancreas and secreted 
into the bloodstream. An inability to produce insulin 
results in diabetes, but daily injections of insulin are 
sufficient to reverse or at least allay the debilitating 
effects of the disease. Prior to production of the re- , 
combinant molecule, insulin for treatment of diabetes 
was obtained from the pancreases of pigs and cows. 
Although this insulin is biologically active in humans, 
the amino acid sequences are not identical to that of 
the human molecule. Thus, some patients produced 
antibodies against injected insulin, occasionally re- 
sulting in serious immune reactions. Because recom- 
binant human insulin is identical to the natural 
product, immunogenicity should not be a problem. 

In mammals, insulin is expressed as a single-chain 
prepro-hormone, which is secreted through the plasma 
membrane. A prepro-hormone contains extra amino 
acids not present in the mature hormone. Amino- 
terminal amino acids form the pre sequence and target 
the expressed protein for secretion. The pro sequence 
is a stretch of amino acids in the middle of the hormone 
sequence that is important for folding the polypeptide 



chain into the correct structure. During secretion, 
these extra amino acids are cleaved from the prepro- 
hormone by cellular proteases to release the mature 
insulin molecule, consisting of two short polypeptide 
chains, A and B, linked by two disulfide bonds. The 
principal challenge in the production of recombinant 
insulin was getting insulin assembled into this mature 
form. The initial approach was to construct synthetic 
genes from oligonucleotides that separately encoded 
the A and B chains. These were individually inserted 
into the £ coli gene encoding yS-galactosidase, so the 
bacteria produced large fusion proteins that had the 
insulin sequences tacked onto the end of the /?- 
galactosidase enzyme (Figure 23-1). These large pro- 
teins were purified from bacterial extracts, and the 
insulin chains were released by treatment with cyano- 
gen bromide, a chemical that cleaves peptide bonds 
following methionine residues. Because a methionine 
codon had been inserted at the boundaries between 
/?-galactosidase and the insulin chains in the fusion 
proteins, cyanogen bromide treatment clipped intact 
insulin chains off the fusion proteins. These were pu- 
rified, mixed, and reconstituted into an active insulin 
molecule. This approach was refined by producing a 
single ^-galactosidase— insulin fusion protein, which 
could be cleaved in a single step to release mature 
insulin. A similar method is now in use for the com- 
mercial production of recombinant insulin. 



Recombinant Human Growth 
Hormone is Produced in 
Bacteria by Two Methods 

Growth hormone is a 191 amino acid protein that is 
produced in the pituitary gland and regulates growth 
and development Children born with growth hor- 
mone deficiency — hypopituitary dwarfs — never 
achieve normal stature. Regular injections of growth 
ho rrnone stimulate the growth of these children so 
that they reach near-normal heights. Unlike the sit- 
uation with insulin, animal-derived growth hormones 
are ineffective. Only the -human protein works, and 
for many years it was painstakingly extracted from the 
pituitaries of human cadavers. One unforeseen and 
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unfortunate consequence of growth hormone treat- 
ment, however, was the infection of a number of chil- 
dren with a fatal virus from one of the cadavers 
Producuon of recombinant human growth hormone 
(hGH) would clearly provide a safe, reliable, and plen- 
tiful source of this drug. 

The initial production of hGH was achieved by 
constructing a hybrid gene from the natural hGH 
cUNA and synthetic oligonucleotides that encoded 
the amino terminus of the mature form of the protein 
figure 23-2a). This coding sequence was ligated into 



FIGURE 23-1 

Expression of human insulin in £ col, Recombinant insulin 
was first made by expressing the A and B chains separately 
then refolding them into a mature insulin molecule A DNA 
fragment encoding each insulin chain was made by anneal ' 
ing two complementary oligonucleorides that had been 
chemtcally synthesized. Each Fragment was ligated into a 
bacterial expression vector so that, when translated the in 
suhn chain would be fused to the carboxy terminus' of the 
enzyme /?-gaIactosidase (/?-gal). The expression vectors 
were transformed into £ coli, and the ^-gal-insulin fusion 
proteins accumulated inside the bacterial cells The cells 
were harvested, and each /J-gal-i nS ulin ^on protein was 
purified. The insulin-coding DNA was synthesized so thar it 
starred with a methionine codon. This serup provided a wav 
to cleave off the /?-gal pm f rom ^ insujpolypepdde " 
Treatment ofthe fiision protein with the chemical cyanogen 
bromide (CNBr) results ,n cleavage of peptide bonds™ 
all meth.onmes. In this way, the natural insulin peptides 
were obtained. Because /?-gal contains other methionine res- 
idues, CNBr treatment cleaved it into many small peptides 
The insulin chains were not cleaved further because they " 
did not contain internal methionines. The A and B chains 
were purified and then mixed together to form active re- 
combinant insulin. 



a plasmid adjacent to a bacterial promoter. Like insulin 
hGH is normally produced as a larger precursor pro- 
tein containing an amino-terminal signal sequence 
Because the human signal sequence would not be rec- 
ognized by the bacterial secretion machinery the 5' 
end of the cDNA was reengineered with a synthetic 
sequence enabling the bacteria to produce a 
nearly normal version of the mature human protein. 

The first hGH expression vectors'directed the pro- 
duction of the protein inside the cell. Purification re- 
quired many steps to separate hGH from the thousands 
of intracellular bacterial proteins. Another way to pro- 
duce the protein in bacteria is to engineer the protein 
so it is secreted. This can be done by linking the coding 
sequence for the desired protein to a signal sequence 
from a secreted bacterial protein, thus forming a pre- 
hormone (Figure 23-2b). Human growth hormone is 
produced by the bacteria and then secreted with the 
concomitant removal of the signal peptide by a bac~ 
tenal protease. Secretion into the periplasm, where 
there are fewer proteins than inside the ceil makes 
purification simpler. The only difference between the 
secreted hGH and that produced intracellular^ is the 
presence of an amino-terminal methionine on the in- 
tracellularly expressed molecule. Because the secreted 
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FIGURE 23-2 

Bacterial production of human growth hormone (hGH). (a) An expression vector was con- 
structed for intracellular production of hGH. The coding sequence was constructed by iso- 
lating from the cDNA a DNA fragment that encoded amino acids 24-191 and ligating this 
to a synthetic oligonucleotide fragment that encoded amino acids 1-24. Following introduc- 
tion of the expression vector into bacterial cells, recombinant hGH was produced inside the 
cells. The expressed protein behaved just like natural human growth hormone but contained 
the initiator methionine at the amino terminus, (b) A protein can be produced in bacteria 
without this extra methionine by targeting it for secretion. To do this, a DNA fragment 
encoding a bacterial signal sequence, which specifies secretion of a bacterial protein, was 
placed in front of the hGH coding sequence. Upon introduction of this vector into bacteria, 
hGH is produced, and the signal sequence targets the protein for secretion. The protein 
accumulates in the periplasmic space between die inner and outer bacterial membranes and 
can be released by hypotonic disruption of the outer membrane. In contrast to the intracellu- 
lar form of hGH, the protein produced by this procedure does not contain an initiator me- 
thionine, since a periplasmic protease cleaved off the signal sequence. 
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•form lacks this methionine, it is called met4ess hGH. 
Bacterially expressed hGH has been administered ro 
Thousands of growth hormone-deficient children, who 
have' benefited greatly from this recombinant drua 




A Hepatitis B Virus Vaccine Is. 
Produced in Yeast by Expression 
of a Viral Surface Antigen 

One of the successes of modern medicine is the de- 
velopment and implementation of vaccines against in- 
fectious diseases. Prior to the advent of recombinant 
DNA technology, two types of vaccines were used. 
Inactivated vaccines are chemically killed derivatives 
of the actual infectious agent. Attenuated vaccines are 
live viruses or bacteria altered so that they no longer 
multiply in the inoculated organism. Both types of 
vaccines work by presenting surface proteins (anti- 
gens) to B and T lymphocytes, which become primed 
to respond rapidly should the organism actually be- 
come infected, usually destroying the infectious agent 
before any damage is done (Chapter 16). However, 
these types of vaccines are potentially dangerous be- 
cause they can be contaminated with infectious or- 
ganisms. F or example, a small number of children each 
year contract polio from their polio vaccinations. Thus, 
one of the most promising applications of recombinant 
DNA technology is the production of subunit vaccines 
consisting solely of the surface protein to which the 
immune system responds. With a subunit vaccine, 
there is no risk of infection. 

The first successful subunit vaccine was produced 
for hepatitis B virus (HBV), which infects the liver 
and causes liver damage and, in some cases, cancer. 
The virus particle is coated with a surface antigen, 
HBsAg, and infected patients carry large aggregates 
of this protein in their blood. Early experiments sug- 
gested that these aggregates' would make a potent 
vaccine, but how could they be produced in quantities 
sufficient to vaccinate large populations against HBV? 
With the cloning of the HBV genome, the possibility 
of a subunit vaccine could be explored. Initial attempts 
to produce the HBsAg protein in £ coli failed, so 
researchers turned to yeast. The HBsAg gene was 
inserted into a high-copy yeast expression vector (Fig- 



ure 13-3) and engineered, in this case, so that it would 
not be secreted (Figure 23-3). Yeast transformed with 
this.plasmid produced large quantities of the viral 
protein (about 1-2 percent of the total yeast protein). 
By growing the yeast in large fermentors, it was pos- 
sible to produce 50-100 mg of the protein per liter 
of culture. This recombinant protein closely resembled 
the natural viral protein; it even formed aggregates 
with properties similar to those of the immunogenic 
aggregates found in infected patients. The yeast pro- 
tein is now used commercially to vaccinate people 
against HBV infection.- 

Vaccines against many human and animal patho- 
gens are currently in various stages of development. 
Recombinant DNA technology has provided a safe 
means to work with and to inoculate children and 
adults with ohly noninfectious parts of infectious 
agents. In Chapter 25, we will discuss various strategies 
for the development of a vaccine against the AIDS 



virus. 



Complex Human Proteins Are 
Produced by Large-Scale 
Mammalian Cell Culture 

Most of the recombinant proteins we have discussed 
thus far in this chapter are relatively small and simple 
in both structure and function. Other proteins of med- 
ical interest are considerably more complicated in 
structure and function, and biologically active proteins 
have proved difficult to produce in bacteria and yeast. 
In these cases, biotechnology companies have resorted 
to using mammalian cells for protein production. 
Mammalian cells are finicky and expensive to grow, 
but they can be counted on to produce correctly mod- 
ified, fully active proteins. Thus, much effort in the 
biotechnology industry has been devoted to setting up 
fermentor systems for large-scale culture of mam- 
malian cells. 

The first drug to be produced commercially by 
mammalian cell culture was tissue plasminogen activator 
or tPA t which is administered to heart attack victims. 
Tissue plasminogen activator is a protease, an enzyme 
that cleaves other proteins. It works by clipping plas- 
■minogen, an inactive precursor protein, to form plasmin, 
itself a potent protease that degrades fibrin, the protein 
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FIGURE 23-3 

Production of a subunk vaccine in yeast. Hepatitis B vims 
(HBV) is encoded by a small 3.2-Icb genome that has been 
cloned and sequenced. Both the whole virus and a smaller 
HBsAg (HBV surface antigen) particle are found in the 
blood of infected patients. To prepare a vaccine against 
HBV, which has been difficult to propagate in culture, the 
HBsAg gene was cloned into a vector for expression in the 
yeast Saccbaromyccs cerevisiae. Transcription occurs from rhe 
strong promoter from the gene encoding alcohol dehydro- 
genase L A transcription terminator was placed downstream. 
The vector contains replication origins and markers for both 
bacteria and yeasL Yeast transformed with this plasmid can 
be grown to high cell densities in fermentors. This process 
results in the accumulation of large amounts of HBsAg 
protein, which upon purification was found to aggregate 
into particles about 20 nanometers in diameter, resembling 
the particles found in HBV-infected patients: 



chat forms blood clots. Rapid administration of a plas- 
minogen activator after a heart attack dissolves the 
life-threatening clots that lead to irreversible damage 
of heart muscle. Tissue plasminogen activator is com- 
mercially produced from a mammalian cell line car- 
rying a stably integrated, highly amplified expression 
vector (Figure 23-4). 

Another protein being produced by mammalian cell 
culture is Factor VIII, a protein required for normal 
clotting of the blood. Genetic defects in Factor VIII 
production are responsible for hemophilia. For many 
years, hemophiliacs have been treated with Factor VIII 
punned from human blood. With- the contamination 
of the human blood supply by the AIDS virus, how- 
ever, thousands of hemophiliacs became infected and 
hundreds died from AIDS. The Factor VIII cDNA 
had already been cloned before scientists found that 
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the blood supply was contaminated with the AIDS 
virus. Recognition of the need for a safer source of 
Factor VIII accelerated efforts already under way to 
produce the protein using recombinant DNA methods 
Like tPA, Factor VIII is a large and complex protein 
and can only be efficiently produced in mammalian 
ce : culture. But the a vailabihty of recombinant protein 
will spare future generations of hemophiliacs from 
infectious agents that contaminate the blood supply. 



Monoclonal Antibodies Functi 
as "Magic Bullets" 



on 



We have d,scussed the use of biotechnology to pro- 
duce novel vaccines that elicit antibody production 
by the body's immune system. As we learned in Chap- 
ter 16, antibodies are exquisitely selective proteins that 
can bind to a single target among millions of irrelevant 
sites. Researchers have long dreamed of harnessing the 
specificity of antibodies for a variety of uses that re- 
quire the targeting of drugs and other treatments to 
particular sites in the body. It is this use of antibodies 
as targeting devices that led to the concept of the 
magic bullet," a treatment that could efFectively seek 
and destroy tumor cells and infectious agents wherever 
they resided. 

The major limitation in the therapeutic use of an- 
tibodies is producing a useful antibody in large quan- 
tities. Initially, researchers screened myelomas, which 
are antibody-secreting tumors, for the production of 
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FIGURE 23-5 

Production of a monoclonal antibody (MAb). A mouse is 
inoculated with an antigen (Ag) of interest. This stimulates 
the proliferation of lymphocytes expressing antibodies 
against the antigen. Lymphocytes are taken from the spleen 
and fused to myeloma cells by treatment with polyethylene 
glycol. Hybrid cells are selected by growth in HAT medium 
(Chapter 12). The myeloma cells lack the enzyme HPRT 
and thus die in this medium unless they become fused wirh 
a lymphocyte, which expresses the missing enzyme. Unfused 
lymphocyte cells soon die off as well, because they do not 
grow for long in culture. Individual hybrid cells are 
transferred to the wells of a microliter dish and culrured 
for several days. Aliquots of the culture fluids are removed 
and tested for the presence of antibody (Ab) that binds the 
antigen. Cells that test positive are cultured for monoclonal 
antibody production. Antibody-producing cell lines are 
stored frozen in liquid nitrogen (this process is called cell 
banking Aliquots can be chawed out and cultured as needed. 



is inoculated with the antigen to which an antibody 
is desired. After the animal mounts an immune re- 
sponse to the antigen, its spleen, which houses anti- 
body-producing cells (lymphocytes), is removed, and 
the spleen cells are fused en masse to a specialized 
myeloma cell line that no longer produces an antibody 
of its own. The resulting fused cells, or hybridomas, 
retain properties of both parents. They grow contin- 
uously and rapidly in culture like the myeloma cell, 
yet they produce antibodies specified by the lympho- 
cyte from the immunized animal. Hundreds of hy- 
bridomas can be produced from a single fusion 
experiment, and they are systematically screened to 
identify those producing large amounts of a desired 
antibody. Once identified, this antibody is available in 
limitless quantities. Monoclonal antibodies are already 
widely used for the diagnosis of infections and cancer 
and for the imaging of tumors for radiotherapy. And 
investigations into their use in the direct treatment of 
cancer, inflammation, and immune disorders is on the 
rise. 



ium 



useful antibodies. But they lacked a means to program 
a myeloma to produce an antibody to their specifi- 
cations. This situation changed dramatically with the 
development of monoclonal antibody technology. The 
procedure for producing monoclonal antibodies, or 
xVlAbs, is shown in Figure 23-5. First, a mouse or rat 



Human Antibodies That Recognize 
Specific Antigens Can Be Directly 
Cloned and Selected 

One new application of monoclonal antibody tech- 
nology is the generation of abzymes, antibodies that 
behave like enzymes to catalyze a chemical reaction. 



FIGURE 23-6 

Direa cloning of antibody cDNAs by PCR. To engineer ^ 
antibody, the ammo acid sequence of the variable domain 
needs to be determined. This could be done by sequencing 
a punned preparation of the heavy- (H) and light- (L) chain 
proteins, but a simpler method is to deduce the sequence 
from the cloned cDNA. In the past, a cDNA library was 
prepared from hybridoma mRNA and screened with probes 
irom the constant regions of the H and L chain genes A 
simpler method has been developed that uses the PCR 
From a comparison of a large number of antibody 
sequences, amino acids frequently found at the amino 
termini of antibodies were identified. From this information 
a set of degenerate PCR primers was designed chat cor- ' 
respond to all the possible sequences in this region. Because 
the amino acids in the constant domains of different anti- 
bodies are nearly identical, only one PCR primer is needed 
for the > end of each H and L chain sequence. To directlv 
clone the antibody cDNAs, cDNA is prepared by treating ' 
hybridoma mRNA with reverse transcriptase, mixed with , 
pair of PCR primers (in this case, for amplifying the 
heavy chain sequences), and subjected to PCR Without 
knowledge bf the amino terminus of the antibody chain a 

ff had to be set U P «^th each of the different 5' primers 
until an amplified DNA fragment was obtained. The pro- 
cess can be simplified if the sequence of the first six or 
seven amino acids of the antibody can be determined this 
is sufficient to design a single 5' PCR primer 



Enzymes catalyze reactions by stabilizing a chermcal 
structure intermediate between the substrate and 
product, termed the transition state. Thus, if monoclonal 
antibodies could be made to a transition state ana- 
logue—a molecule resembling the transition state of 
a chemical reaction— then some of these antibodies 
might have catalytic activity. The ability to produce 
custom-des lg ned catalysts would be very valuable es- 
pecially to the chemical and pharmaceutical industries 
Initial attempts to produce catalytic antibodies in- 
dicated that they were exceedingly rare and often not 
found among the hybridomas produced by conven- 
tional monoclonal antibody technology. An excellent 
fusion might produce several hundred different an- 
tibodies, but the entire repertoire of antibodies that 
can be produced by the immune system is perhaps 
100 million. How can the entire repertoire be tapped^ 
One strategy, that shows promise is to bypass the 
inefficient fusion step in hybridoma production and 
directly clone antibody cDNAs from the lymphocytes 
of immunized mice (Figures 23-6 and 23-7). Inves- 
tigators inoculated a mouse with an antigen. They 
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recovered spleen cells from the mouse and used PCR 
to amplify millions of cDNAs for antibody light and 
heavy chains. The light- and heavy-chain cDNAs were 
cloned separately into phage vectors and then recom- 
bined m vitro to generate a third, combinatorial library 
of phage carrying random pairs of light and heavy 
chains. The library was plated onto a bacterial lawn 
and the resulting phage plaques, each containing a' 
unique antibody, were screened with radioactively la- 
beled antigen in a manner similar to that used for 
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FIGURE 23-7 

Creating a combinatorial library of antibodies expressed in 
. £ coli. Lymphocytes, or spleen cells are removed from an 
immunized animal. mRNA is obcained and cDNA is synthe- 
sized with reverse transcriptase. The heavy- (H) and "liahr- 
(L) chain genes are separately amplified by PCR, as shown 
in Figure 23-6, and ligated inco X cloning vectors. Two dif- 
ferent libraries are produced, one containing the H chain 
genes and one containing rhe L chain genes (this step has 
been omitted from the figure for simplicity). Phage DNA is 
isolated from each library, and rhe H and L chain sequences 
are ligated together and packaged to form a combinatorial 
library. Each phage now contains a random pair of H and 
L chain cDNAs and thus upon infection of £ coli directs 
the expression of the two antibody chains in infected cells. 
Since the H chain sequence contains only the variable 
region and the first constant domain, the antibody that 
forms is called a Fab, for antigen binding fragment. It binds 
the antigen much like an intact antibody but it lacks the 
effector domain. To identify- an antibody that recognizes the 
antigen, the phage library is plated, and the antibody (Fab) 
molecules present in the plaques are transferred to filters. 
The filters are incubated with radioactively labeled antigen 
and then washed to remove excess unbound ligand. A 
radioactive spot on the autoradiogram identifies a plaque 
that contains an antibody that binds the antigen. A recent 
procedure uses the phage display technology, described in 
Figure 23-10, to select antibodies with desired properties. 
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clohing.-eDNAs from an expression library (Figure 
7-10). Out of a million phage plaques screened, 200 
clonp were identified that produced an antibody bind- 
ing the antigen. Thus, with this approach, investigators 
were able to sample a million possible antibodies— 
at least a thousand times more than they could screen 
by. conventional monoclonal antibody technology. 
Since phages in a particular plaque encode the anti- 
body expressed in the plaque, it is a trivial matter to 
clone the heavy- and light-chain cDNAs from the 
phage DNA. These cDNAs can be placed into bac- 
terial or mammalian expression vectors for production 
of large quantities of the selected antibody. 

A recent modification of this method uses filamen- 
tous phages such as Ml 3 instead of X phage and allows 
display of the antibodies on the phage surface. This 
offers the advantage of being able to screen thousands 
more phage (because the screening can.be done. in 
solution) and to select phage that express tight-binding 
antibodies. We will discuss this method later and in 
Figure 23-10. 
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Humanized" Monoclonal Antibodies 
Retain Activity But Lose 
Immxinogenicity 

Although swift progress is being made in the identi- 
fication of monoclonal antibodies with potential ther- 
apeutic value, their use is limited by a problem we 
have already discussed in this chapter. Monoclonal 
antibodies are usually mouse proteins, and they are 
not identical to human antibodies. Thus, antibodies 
injected into a patient will eventually be recognized 
as foreign proteins and will be cleared from the 
circulation. 

As we learned in Chapter- 16, both chains of the 
antibody molecule can be divided into variable and 
constant regions. The variable regions differ in se- 
quence from one antibody to another, and this is the 
region of the protein that binds the antigen. The con- 
stant region is the same among all antibodies of the 
same type. The first method used to reduce the im- 
munogenicity. of a mouse monoclonal antibody was 
simply to construct 'chimeric genes that encoded pro- 
teins in which the variable regions from the mouse 
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Antibody engineering. The basic structure of a mouse mono 
clonal antibody (MAb) resembles that of a human antibody 
However, there are numerous differences between amino 
acid sequences of the antibodies from the two species 
These sequence differences account for the immunogenicity 
of mouse MAbs in humans. A chimeric MAb is constructed 
by hgating the cDNA fragment encoding the mouse V L and 
y H domams to fragments encoding the C domains from a 
human antibody. Because the C domains do not contribute 
to anugen binding, the chimeric antibody will retain the 
same anugen specificity as the original mouse MAb but will 
be closer to human antibodies in sequence. Chimeric MAbs 
sail contains some mouse sequences, however, and may still 
be immunogenic A humanized MAb contains only chose 
mouse amino acids necessary to recognize the antigen 
This product is constructed by building into a human 
antibody the amino acids from the mouse complementarity 
determining regions or CDRs. 



antibody were fiised to the constant regions from a 
human antibody. The chimeric antibody (Figure 
23-8) retained its binding specificity but more closely 
resembled a natural human antibody. 

This antibody, however, was not fully humanized, 
because it retained amino acid sequences from the 
mouse protein. Thus, scientists have set out to engi- 
neer fully humanized monoclonal antibodies that will 
be indistinguishable from natural molecules. Extensive 
studies of the three-dimensional structures of antibody 
molecules tell us that only a few of the one hundred 
amino acids in the variable region of an antibody 
actually contact the antigen; these regions of contact 
are referred to as complementarity determining regions 
(CDRs). Three CDRs each comprise the antigen- 
binding sites on the light and heavy chains. The rest 



of the variable region serves as a scaffold to anchor 
che CDRs in rhe correct positions. This breakdown 
of amino acids in che variable region into those serving 
recognition and those serving structural roles is also 
evident from simply comparing the sequences of many 
antibody molecules. Amino acid sequences in the 
CDRs are hypervariable, whereas the structural, or 
framework, amino acids differ little. 

Thus, to make a fully humanized antibody, all that 
would be required in principle would be to use in 
vitro mutagenesis to transfer the CDR amino acid 
sequences from a mouse MAb to a natural human 
antibody (Figure 23-8). This method was used to hu- 
manize an antibody that recognizes an antigen on the 
surface of human lymphocytes. This humanized MAb 
is now in clinical trials as an immunosuppressant and 
for treatment of lymphoid tumors. Another potentially 
valuable MAb binds a growth factor receptor found 
in large numbers on the surface of many breast tumor 
cells. Laboratory experiments showed that this anti- 
body could block the growth of these cells in culture 
and caused tumors seeded in mice to regress. Unfor- 
tunately, the first humanized versions of this antibody 
bound the receptor protein but failed to block the 
growth of breast carcinoma cells. Investigators sus- 
pected that the problem was with the framework amino 
acids, and they used computer modeling to design 
amino acid substitutions that would strengthen the 
antibody-antigen interaction. Several such variant an- 
tibodies were produced and tested; one bound the 
receptor 250 times more tighdy than did the original 
antibody and successfully blocked tumor cell growth 
in culture. This antibody is now being produced in 
large quantities for clinical trials. 



Protein Engineering Can Tailor 
Antibodies for Specific Applications 

Humanizing monoclonal antibodies is an example of 
the emerging technology of protein engineering, that is, 
a process using recombinant DNA to modify the struc- 
ture of natural proteins to improve or change their 
function. Antibodies are particularly attractive Can- 
dida tes for protein engineering, because their structure 



is understood in great detail and because their poten- 
tial for use in medicine is enormous. Another way in 
which antibodies are being engineered is by changing 
their effector domains, the regions of the heavy chain 
that specify antibody function — for example, killing 
of cells marked by the antibody. In this way, the mode 
of action of a monoclonal antibody can be repro- 
grammed. One promising strategy is to replace the 
effector domain entirely with a sequence encoding* a 
toxin. An antibody-toxin fusion protein would deliver 
the toxin specifically to cells bearing the target antigen. 
This product could be an exceptionally potent treat- 
ment for cancer and for viral diseases such as AIDS. 
Antibody engineering is also being used to construct 
bispecific antibodies. In these antibodies, each of the two 
arms recognizes a different antigen, thus allowing an 
antibody to bridge the two antigens. For example, a 
bispecific antibody could recognize a turnor cell pro- 
tein with one arm and a protein on the surface of a 
killer T cell with the other, thereby bringing the killer 
cells directly to the tumor (Figure 23-9). 



Protein Engineering Is Used to 
Improve a Detergent Enzyme 

Subtilisin is a serine protease produced by bacteria. 
Due to its broad specificity for proteins that commonly 
soil clothing, this enzyme was developed for com- 
mercial use in laundry detergents. (It is subtilisin that 
is prominently advertised as the enzyme additive in 
modern detergents.) But the first detergents containing 
subtilisin suffered from a serious drawback: they could 
not be used with bleach, because bleach inactivates 
the enzyme. Biochemical analysis determined that loss 
of activity was due to the oxidation of a methionine 
at position 222. Once this happened, the modified 
enzyme lost 90 percent of its activity. Because they 
knew which amino acid was bleach sensitive, however, 
scientists decided to see whether a variant of subtilisin 
could be produced that was no longer sensitive to 
bleach. 

To do this, site-directed mutants were constructed 
in the gene encoding subtilisin. The strategy was sim- 
ply to substitute, one at a time, each of the non— wild- 
type amino acids at residue 222. The mutant scenes 
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FIGURE 23-9 

A bispecific antibody. By using recombinant DNA the 
cDNAs for antibodies to two different antigens can be engi- 
neered to make an antibody in which each arm recognizes a 
.different antigen. Thus it is possible to recombine antibodies 
to surface antigen on tumor cells and to a protein on cyto- 
toxic T cells to make a bispecific andbody that brings the 
two cells together to facilitate killing of the tumor cells 



were cloned into expression vectors and the 19 dif- 
ferent subtilisin derivatives were expressed. Biochem- 
ical analysis showed chat the cysteine-222 enzyme was 



even more active rhan che wild-type protein, but it 
was also inactivated by bleach. The next most active 
variant was the alanine-substiruted enzyme, which was 
53 percent as active as wild-type subtilisin. This vari- 
ant exhibited no detectable bleach sensitivity, so de- 
tergents containing this engineered subtilisin can now 
be used with bleach. This new variant of subtilisin ls 
an example of a second-generation molecule, a molecule 
specifically engineered for a new desirable trait. Pro- 
tein engineers are currently at work on a third-gen- 
eration molecule that exhibits decreased temperature 
sensitivity so that it can be used in hot water. 
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Growth Hormone Variants 
with Improved Binding Are 
Selected by Phage Display 

To engineer an improved subtilisin enzyme, research- 
ers were aided by the knowledge that only one specific 
amino acid had to be changed. Thus, they could sys- 
tematically vary that amino acid to find the one that 
worked the best. But more complex challenges face 
protein engineers. Is it possible, for example* to en- 
gineer antibodies with higher affinity for antigen; to 
design an inhibitor that tightly binds to and blocks a 
cell-surface protein or an enzyme inside a cell; to 
generate a growth factor or hormone with increased 
affinity for its receptor? Alterations of this sort require 
several amino acid changes, and with 20 possible amino 
acids at each position, the number of variants that 
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need to be screened is enormous (for changes at just 
3 amino acids, there are 8000 different combinations; 
for 10 amino acids, 10 13 different proteins are possible). 
Clearly, these variants cannot be made and tested one 
at a time, and a method for direct selection of improved 
proteins is needed. 

Researchers have used, a new approach to select 
variants of human growth hormone with increased 
affinity for growth hormone receptor (Figure 23-10). 




Medicine and Indu 



stry 



467 



FIGURE 23-10 

Expression of proteins and peptides on the surface of 
filamentous phage. A library of randomly mutated hGH 
cDNAs was ligated into an Ml3-based phagemid vector so 
that hGH was fused to the carboxy-terminal domain of the 
Ml 3 gene III protein. The carboxy terminus of the gene 111 
protein associates with the phage particle, and the amino 
terminus, containing the hGH variants, is displayed on the 
outer surface of the phage. The library of phagemids is 
introduced into £ coli t and ampicillin-resistant colonies are 
obtained. These £ colt are then infected with a helper phage 
that induces the production of phagemid particles. Only 1- 
10 percent of the phage particles contain an hGH-gene ill 
fusion protein, and these contain only one hGH fusion 
molecule per phage. This ensures that the phage retain 
sufficient wild-type gene III protein in their coats to 
remain infectious. hGH-phage were passed through a 
column containing the hGH receptor covalently linked to 
plastic beads. Only the phage expressing hGH were re- 
tained. The nonbinding phage lacking hGH passed through 
the column. The bound phage were isolated, cultured in 
£ coli, and. passed, again over the column. Repeated rounds 
of selection resulted in the identification of hGH variants 
that bound the receptor with exceptionally high affinity. 



From structural studies and extensive mutagenesis of 
hGH, they knew what portions of the amino acid 
sequence were important for receptor binding. They 
synthesized degenerate oligonucleotides that encoded 
all possible amino acids at these positions and ligated 
the pool of oligonucleotides in place of the natural 
hGH sequence. The resulting pool of variant hGH 
cDNAs was fused to the reading frame of gene III in 
the filamentous phage Ml 3. Gene III encodes a minor 
phage coat protein expressed on the surface of the 
phage, and incorporation of the hGH cDNA into this 
gene results in the display of the hGH variants on the 
surface of the phage, one variant per phage. This tech- 
nique is known as phage display. 

Now it was a simple matter to pass this library of 
more than 10 1 ? different phage over a column con- 
taining the hGH receptor. Phage displaying weakly 
binding hGH variants were washed off the column, 
and phage displaying tightly binding variants were 
recovered with a more stringent wash. This population 
of tight-binding phage was amplified by infection of 
£ coli and passed over the column a second time The 
selection was repeated for a. total of six rounds, each 
round enriching for the phage displaying hGH variants 
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,. with highest affinity for the recepcor bound to the 
column. At this point, individual phage were cloned, 
th.e affinities of their .hGH variants were measured 
directly, and the sequences of the hGH cDNAs were 
examined. Among these variants was one that bound 
its receptor about 10 times more tightly than natural 
hGH did. When selected amino acids from another 
region of hGH that had been randomized were in- 
troduced into this variant, the resulting hGH molecule ■ 
bound to the hGH receptor over 50 times more tightly 
than the wild-type hGH did. This process is being 
repeated in the hope to obtain even more tightly bind- 
ing variants. 

The ability afforded by techniques such as phage 
display to correlate protein structure and function in 
a systematic way makes possible new methods of find- 
ing novel drugs. If researchers have a good idea what 
combination of amino acids gives the best fit to the 
binding site on a receptor, the next step in rational 
drug design would be to design, or even select, a small 
peptide that binds as well as the larger protein. And 
then, using computer modeling to display the molec- 
ular contacts between ligand and receptor, researchers 
can attempt to design and synthesize small nonprotein 
molecules that make the same contacts. The end- 
product would be a small organic molecule that could 
be produced more cheaply than a recombinant protein, 
yet would retain the full biological activity of the 
protein hormone. And, more important, such mole- 
cules could be administered orally, thus eliminating . 
the major disadvantage of most recombinant protein 
therapeutics— that they must be delivered directly 
into the bloodstream by injection. This type of rational 
drug design contrasts sharply with the conventional 
approach to drug discovery now in use in the phar- 
maceutical industry, in which an inventory of com- 
pletely unrelated compounds is tested at random until 
ah active compound is found. 



New Technologies Promise New 
Approaches to Drug Design 

The biotechnology industry is in its infancy, and it 
successes to date follow directly from development 
in molecular biology that are already nearly two de- 
cades old. The recombinant drugs currently in clinicai 
use arise from what is by now conventional technol- 
ogy—gene cloning, expression, and mutagenesis to 
improve protein function. These methods will con- 
tinue to turn out new drugs such as erythropoietins 
to treat anemia caused by kidney disease, DNase to 
treat cystic fibrosis, or colony-stimulating factors 
(CSFs) to increase white blood cell production during 
chemotherapy. 

But the true promise of biotechnology is in novel 
technologies that are only now being developed. We 
have mentioned efforts to design catalytic antibodies 
that can accelerate chemical reactions in both medical 
and industrial applications. This is but one example 
of a whole new approach to protein engineering in 
which novel activities can be placed on unrelated pro- 
tein scaffolds, using random mutagenesis coupled with 
selection methods like phage display. Similar goals 
may be achieved by the engineering otribozymes, RNA 
molecules with catalytic activity, and' the use of the 
polymerase chain reaction to select nucleic acid mol- 
ecules that bind tightly to targets of medical impor- 
tance. Another strategy that may see widespread 
application is treatment with antisense DNA and RNA 
to inhibit the expression of oncogenes in tumors or of 
viral genes in infected patients. And a variety of new 
technologies based on viral vectors promise new ap- 
proaches for vaccines and gene therapy. 

Many of these techniques now work in the test 
tube, and the principal challenge facing biotechnology 
companies is to turn these laboratory techniques into 
commercially viable processes^ 
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